SCOPUS 정보 검색 플랫폼

Proceedings of the National Academy of Sciences of the United States of America

Volumn 112, Issue 10, 2015, Pages 3098-3103

Interplay of approximate planning strategies

(8) Huys, Quentin J M a,b Lally, Níall c,d Faulkner, Paul e Eshel, Neir f Seifritz, Erich b Gershman, Samuel J g Dayan, Peter h Roiser, Jonathan P c

a UNIVERSITY OF ZURICH (Switzerland)

b UNIVERSITY HOSPITAL OF PSYCHIATRY (Switzerland)

c UNIVERSITY COLLEGE LONDON (United Kingdom)

d NATIONAL INSTITUTE OF MENTAL HEALTH (United States)

e UNIVERSITY OF CALIFORNIA (United States)

f HARVARD MEDICAL SCHOOL (United States)

g MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

h UNIVERSITY COLLEGE LONDON (United Kingdom)

Author keywords

Hierarchical reinforcement learning; memoization; Planning; pruning

Indexed keywords

ACCURACY; ARTICLE; ARTIFICIAL INTELLIGENCE; BEHAVIORAL SCIENCE; COGNITION; CONCEPTUAL FRAMEWORK; CONTROLLED STUDY; DECISION MAKING; DECISION TREE; FEMALE; HUMAN; HUMAN EXPERIMENT; INTELLIGENCE QUOTIENT; MALE; PLANNING; PRIORITY JOURNAL; PROBABILITY; PROCESS MODEL; REINFORCEMENT; TASK PERFORMANCE; THEORETICAL STUDY; INTELLIGENCE; ORGANIZATION AND MANAGEMENT; STATISTICS;

HUMANS; INTELLIGENCE; PLANNING TECHNIQUES; STOCHASTIC PROCESSES;

EID: 84924325916 PISSN: 00278424 EISSN: 10916490 Source Type: Journal
DOI: 10.1073/pnas.1414219112 Document Type: Article

Times cited : (152)

References (37)

1
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton RS, Precup D, Singh S (1999) Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artif Intell 112:181-211.
- (1999) Artif Intell , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

2
- 70350566799
- Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
- Botvinick MM, Niv Y, Barto AC (2009) Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition 113(3):262-280.
- (2009) Cognition , vol.113 , Issue.3 , pp. 262-280
- Botvinick, M.M.¹ Niv, Y.² Barto, A.C.³

3
- 28044450875
- Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
- Daw ND, Niv Y, Dayan P (2005) Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci 8(12):1704-1711.
- (2005) Nat Neurosci , vol.8 , Issue.12 , pp. 1704-1711
- Daw, N.D.¹ Niv, Y.² Dayan, P.³

4
- 84859371025
- Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees
- Huys QJM, et al. (2012) Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees. PLOS Comput Biol 8(3):e1002410.
- (2012) PLOS Comput Biol , vol.8 , Issue.3 , pp. e1002410
- Huys, Q.J.M.¹

5
- 84859341150
- Habits, action sequences and reinforcement learning
- Dezfouli A, Balleine BW (2012) Habits, action sequences and reinforcement learning. Eur J Neurosci 35(7):1036-1051.
- (2012) Eur J Neurosci , vol.35 , Issue.7 , pp. 1036-1051
- Dezfouli, A.¹ Balleine, B.W.²

6
- 84892682926
- Actions, action sequences and habits: Evidence that goal-directed and habitual action control are hierarchically organized
- Dezfouli A, Balleine BW (2013) Actions, action sequences and habits: Evidence that goal-directed and habitual action control are hierarchically organized. PLOS Comput Biol 9(12):e1003364.
- (2013) PLOS Comput Biol , vol.9 , Issue.12 , pp. e1003364
- Dezfouli, A.¹ Balleine, B.W.²

7
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich TG (2000) Hierarchical reinforcement learning with the MAXQ value function decomposition. J Artif Intell Res 13:227-303.
- (2000) J Artif Intell Res , vol.13 , pp. 227-303
- Dietterich, T.G.¹

8
- 0033607507
- Building neural representations of habits
- Jog MS, Kubota Y, Connolly CI, Hillegaart V, Graybiel AM (1999) Building neural representations of habits. Science 286(5445):1745-1749.
- (1999) Science , vol.286 , Issue.5445 , pp. 1745-1749
- Jog, M.S.¹ Kubota, Y.² Connolly, C.I.³ Hillegaart, V.⁴ Graybiel, A.M.⁵

9
- 0242497620
- The architecture of cognitive control in the human prefrontal cortex
- Koechlin E, Ody C, Kouneiher F (2003) The architecture of cognitive control in the human prefrontal cortex. Science 302(5648):1181-1185.
- (2003) Science , vol.302 , Issue.5648 , pp. 1181-1185
- Koechlin, E.¹ Ody, C.² Kouneiher, F.³

10
- 42749096312
- Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes
- Badre D (2008) Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes. Trends Cogn Sci 12(5):193-200.
- (2008) Trends Cogn Sci , vol.12 , Issue.5 , pp. 193-200
- Badre, D.¹

11
- 67649342617
- Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex
- Ostlund SB, Winterbauer NE, Balleine BW (2009) Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. J Neurosci 29(25):8280-8287.
- (2009) J Neurosci , vol.29 , Issue.25 , pp. 8280-8287
- Ostlund, S.B.¹ Winterbauer, N.E.² Balleine, B.W.³

12
- 78049389920
- Cognitive illusions of authorship reveal hierarchical error detection in skilled typists
- Logan GD, Crump MJC (2010) Cognitive illusions of authorship reveal hierarchical error detection in skilled typists. Science 330(6004):683-686.
- (2010) Science , vol.330 , Issue.6004 , pp. 683-686
- Logan, G.D.¹ Crump, M.J.C.²

13
- 0002444193
- Memo functions and machine learning
- Michie D (1968) Memo functions and machine learning. Nature 218:19-22.
- (1968) Nature , vol.218 , pp. 19-22
- Michie, D.¹

14
- 80052194188
- Technical Report Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA
- O'Donnell TJ, Goodman ND, Tenenbaum JB (2009) Fragment grammars: Exploring computation and reuse in language. Technical Report MIT-CSAIL-TR-2009-013 (Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA).
- (2009) Fragment Grammars: Exploring Computation and Reuse in Language
- O'Donnell, T.J.¹ Goodman, N.D.² Tenenbaum, J.B.³

15
- 84924316007
- MIT Press, Cambridge, MA
- O'Donnell TJ (2015) Productivity and Reuse in Language: A Theory of Linguistic Computation and Storage (MIT Press, Cambridge, MA).
- (2015) Productivity and Reuse in Language: A Theory of Linguistic Computation and Storage
- O'Donnell, T.J.¹

16
- 84924316006
- Technical Report Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA
- Wingate D, Diuk C, O'Donnell T, Tenenbaum J, Gershman S (2013) Compositional policy priors. Technical Report MIT-CSAIL-TR 2013-007 (Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA).
- (2013) Compositional Policy Priors
- Wingate, D.¹ Diuk, C.² O'Donnell, T.³ Tenenbaum, J.⁴ Gershman, S.⁵

17
- 84950934893
- Bayes factors
- Kass R, Raftery A (1995) Bayes factors. J Am Stat Assoc 90(430):773-795.
- (1995) J Am Stat Assoc , vol.90 , Issue.430 , pp. 773-795
- Kass, R.¹ Raftery, A.²

18
- 33749249312
- Hierarchical Dirichlet processes
- Teh YW, Jordan MI, Beal MJ, Blei DM (2006) Hierarchical Dirichlet processes. J Am Stat Assoc 101(476):1566-1581.
- (2006) J Am Stat Assoc , vol.101 , Issue.476 , pp. 1566-1581
- Teh, Y.W.¹ Jordan, M.I.² Beal, M.J.³ Blei, D.M.⁴

19
- 79958143780
- Speed/accuracy trade-off between the habitual and the goal-directed processes
- Keramati M, Dezfouli A, Piray P (2011) Speed/accuracy trade-off between the habitual and the goal-directed processes. PLOS Comput Biol 7(5):e1002055.
- (2011) PLOS Comput Biol , vol.7 , Issue.5 , pp. e1002055
- Keramati, M.¹ Dezfouli, A.² Piray, P.³

20
- 0004102479
- MIT Press, Cambridge, MA
- Sutton RS, Barto AG (1998) Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA).
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

21
- 84886934503
- Goal neglect and knowledge chunking in the construction of novel behaviour
- Bhandari A, Duncan J (2014) Goal neglect and knowledge chunking in the construction of novel behaviour. Cognition 130(1):11-30.
- (2014) Cognition , vol.130 , Issue.1 , pp. 11-30
- Bhandari, A.¹ Duncan, J.²

22
- 0022055330
- Evidence of hierarchies in cognitive maps
- Hirtle SC, Jonides J (1985) Evidence of hierarchies in cognitive maps. Mem Cognit 13(3):208-217.
- (1985) Mem Cognit , vol.13 , Issue.3 , pp. 208-217
- Hirtle, S.C.¹ Jonides, J.²

23
- 0013465187
- Automatic discovery of subgoals in reinforcement learning using diverse density
- Morgan Kaufmann, San Francisco
- McGovern A, Barto AG (2001) Automatic discovery of subgoals in reinforcement learning using diverse density. Proceedings of the Eighteenth International Conference on Machine Learning (Morgan Kaufmann, San Francisco), pp 361-368.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 361-368
- McGovern, A.¹ Barto, A.G.²

24
- 14644414684
- Fine-to-coarse' route planning and navigation in regionalized environments
- Wiener JM, Mallot HA (2003) 'Fine-to-coarse' route planning and navigation in regionalized environments. Spat Cogn Comput 3:331-358.
- (2003) Spat Cogn Comput , vol.3 , pp. 331-358
- Wiener, J.M.¹ Mallot, H.A.²

25
- 31844447221
- Identifying useful subgoals in reinforcement learning by local graph partitioning
- Assoc for Computing Machinery, New York
- Şimşek Ö, Wolfe AP, Barto AG (2005) Identifying useful subgoals in reinforcement learning by local graph partitioning. Proceedings of the 22nd International Conference on Machine Learning (Assoc for Computing Machinery, New York), pp 816-823.
- (2005) Proceedings of the 22nd International Conference on Machine Learning , pp. 816-823
- Şimşek, Ö.¹ Wolfe, A.P.² Barto, A.G.³

26
- 77953537026
- Node centrality in weighted networks: Generalizing degree and shortest paths
- Opsahl T, Agneessens F, Skvoretz J (2010) Node centrality in weighted networks: Generalizing degree and shortest paths. Soc Networks 32:245-251.
- (2010) Soc Networks , vol.32 , pp. 245-251
- Opsahl, T.¹ Agneessens, F.² Skvoretz, J.³

27
- 84875674596
- Neural representations of events arise from temporal community structure
- Schapiro AC, Rogers TT, Cordova NI, Turk-Browne NB, Botvinick MM (2013) Neural representations of events arise from temporal community structure. Nat Neurosci 16(4):486-492.
- (2013) Nat Neurosci , vol.16 , Issue.4 , pp. 486-492
- Schapiro, A.C.¹ Rogers, T.T.² Cordova, N.I.³ Turk-Browne, N.B.⁴ Botvinick, M.M.⁵

28
- 84924316005
- Decision-theoretic psychiatry
- in press
- Huys QJM, Guitart-Masip M, Dolan RJ, Dayan P (2015) Decision-theoretic psychiatry. Clin Psychol Sci, in press.
- (2015) Clin Psychol Sci
- Huys, Q.J.M.¹ Guitart-Masip, M.² Dolan, R.J.³ Dayan, P.⁴

29
- 0004049893
- PhD thesis (Cambridge Univ, Cambridge, UK)
- Watkins CJCH (1989) Learning from delayed rewards. PhD thesis (Cambridge Univ, Cambridge, UK).
- (1989) Learning From Delayed Rewards
- Watkins, C.J.C.H.¹

30
- 33644782012
- Dynamic response-by-response models of matching behavior in rhesus monkeys
- Lau B, Glimcher PW (2005) Dynamic response-by-response models of matching behavior in rhesus monkeys. J Exp Anal Behav 84(3):555-579.
- (2005) J Exp Anal Behav , vol.84 , Issue.3 , pp. 555-579
- Lau, B.¹ Glimcher, P.W.²

31
- 34447632392
- Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex
- Seo H, Barraclough DJ, Lee D (2007) Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex. Cereb Cortex 17(Suppl 1):i110-i117.
- (2007) Cereb Cortex , vol.17 , pp. i110-i117
- Seo, H.¹ Barraclough, D.J.² Lee, D.³

32
- 0001240712
- Experience-weighted attraction learning in coordination games: Probability rules, heterogeneity, and time-variation
- Camerer C, Ho TH (1998) Experience-weighted attraction learning in coordination games: Probability rules, heterogeneity, and time-variation. J Math Psychol 42(2/3):305-326.
- (1998) J Math Psychol , vol.42 , Issue.2-3 , pp. 305-326
- Camerer, C.¹ Ho, T.H.²

33
- 79952746011
- Model-based influences on humans' choices and striatal prediction errors
- Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ (2011) Model-based influences on humans' choices and striatal prediction errors. Neuron 69(6):1204-1215.
- (2011) Neuron , vol.69 , Issue.6 , pp. 1204-1215
- Daw, N.D.¹ Gershman, S.J.² Seymour, B.³ Dayan, P.⁴ Dolan, R.J.⁵

34
- 72849112662
- Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task
- Rutledge RB, et al. (2009) Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task. J Neurosci 29(48):15104-15114.
- (2009) J Neurosci , vol.29 , Issue.48 , pp. 15104-15114
- Rutledge, R.B.¹

35
- 84860163389
- Serotonin selectively modulates reward value in human decision-making
- Seymour B, Daw ND, Roiser JP, Dayan P, Dolan R (2012) Serotonin selectively modulates reward value in human decision-making. J Neurosci 32(17):5833-5842.
- (2012) J Neurosci , vol.32 , Issue.17 , pp. 5833-5842
- Seymour, B.¹ Daw, N.D.² Roiser, J.P.³ Dayan, P.⁴ Dolan, R.⁵

36
- 0032421570
- The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10
- quiz 34-57
- Sheehan DV, et al. (1998) The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry 59(Suppl 20):22-33, quiz 34-57.
- (1998) J Clin Psychiatry , vol.59 , pp. 22-33
- Sheehan, D.V.¹

37
- 0009761719
- The Psychological Corp, San Antonio, TX
- Wechsler D (2001) Wechsler Test of Adult Reading Manual (The Psychological Corp, San Antonio, TX).
- (2001) Wechsler Test of Adult Reading Manual
- Wechsler, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.