메뉴 건너뛰기




Volumn 112, Issue 10, 2015, Pages 3098-3103

Interplay of approximate planning strategies

Author keywords

Hierarchical reinforcement learning; memoization; Planning; pruning

Indexed keywords

ACCURACY; ARTICLE; ARTIFICIAL INTELLIGENCE; BEHAVIORAL SCIENCE; COGNITION; CONCEPTUAL FRAMEWORK; CONTROLLED STUDY; DECISION MAKING; DECISION TREE; FEMALE; HUMAN; HUMAN EXPERIMENT; INTELLIGENCE QUOTIENT; MALE; PLANNING; PRIORITY JOURNAL; PROBABILITY; PROCESS MODEL; REINFORCEMENT; TASK PERFORMANCE; THEORETICAL STUDY; INTELLIGENCE; ORGANIZATION AND MANAGEMENT; STATISTICS;

EID: 84924325916     PISSN: 00278424     EISSN: 10916490     Source Type: Journal    
DOI: 10.1073/pnas.1414219112     Document Type: Article
Times cited : (152)

References (37)
  • 1
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton RS, Precup D, Singh S (1999) Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artif Intell 112:181-211.
    • (1999) Artif Intell , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 2
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
    • Botvinick MM, Niv Y, Barto AC (2009) Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition 113(3):262-280.
    • (2009) Cognition , vol.113 , Issue.3 , pp. 262-280
    • Botvinick, M.M.1    Niv, Y.2    Barto, A.C.3
  • 3
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw ND, Niv Y, Dayan P (2005) Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci 8(12):1704-1711.
    • (2005) Nat Neurosci , vol.8 , Issue.12 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 4
    • 84859371025 scopus 로고    scopus 로고
    • Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees
    • Huys QJM, et al. (2012) Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees. PLOS Comput Biol 8(3):e1002410.
    • (2012) PLOS Comput Biol , vol.8 , Issue.3 , pp. e1002410
    • Huys, Q.J.M.1
  • 5
    • 84859341150 scopus 로고    scopus 로고
    • Habits, action sequences and reinforcement learning
    • Dezfouli A, Balleine BW (2012) Habits, action sequences and reinforcement learning. Eur J Neurosci 35(7):1036-1051.
    • (2012) Eur J Neurosci , vol.35 , Issue.7 , pp. 1036-1051
    • Dezfouli, A.1    Balleine, B.W.2
  • 6
    • 84892682926 scopus 로고    scopus 로고
    • Actions, action sequences and habits: Evidence that goal-directed and habitual action control are hierarchically organized
    • Dezfouli A, Balleine BW (2013) Actions, action sequences and habits: Evidence that goal-directed and habitual action control are hierarchically organized. PLOS Comput Biol 9(12):e1003364.
    • (2013) PLOS Comput Biol , vol.9 , Issue.12 , pp. e1003364
    • Dezfouli, A.1    Balleine, B.W.2
  • 7
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich TG (2000) Hierarchical reinforcement learning with the MAXQ value function decomposition. J Artif Intell Res 13:227-303.
    • (2000) J Artif Intell Res , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 9
    • 0242497620 scopus 로고    scopus 로고
    • The architecture of cognitive control in the human prefrontal cortex
    • Koechlin E, Ody C, Kouneiher F (2003) The architecture of cognitive control in the human prefrontal cortex. Science 302(5648):1181-1185.
    • (2003) Science , vol.302 , Issue.5648 , pp. 1181-1185
    • Koechlin, E.1    Ody, C.2    Kouneiher, F.3
  • 10
    • 42749096312 scopus 로고    scopus 로고
    • Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes
    • Badre D (2008) Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes. Trends Cogn Sci 12(5):193-200.
    • (2008) Trends Cogn Sci , vol.12 , Issue.5 , pp. 193-200
    • Badre, D.1
  • 11
    • 67649342617 scopus 로고    scopus 로고
    • Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex
    • Ostlund SB, Winterbauer NE, Balleine BW (2009) Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. J Neurosci 29(25):8280-8287.
    • (2009) J Neurosci , vol.29 , Issue.25 , pp. 8280-8287
    • Ostlund, S.B.1    Winterbauer, N.E.2    Balleine, B.W.3
  • 12
    • 78049389920 scopus 로고    scopus 로고
    • Cognitive illusions of authorship reveal hierarchical error detection in skilled typists
    • Logan GD, Crump MJC (2010) Cognitive illusions of authorship reveal hierarchical error detection in skilled typists. Science 330(6004):683-686.
    • (2010) Science , vol.330 , Issue.6004 , pp. 683-686
    • Logan, G.D.1    Crump, M.J.C.2
  • 13
    • 0002444193 scopus 로고
    • Memo functions and machine learning
    • Michie D (1968) Memo functions and machine learning. Nature 218:19-22.
    • (1968) Nature , vol.218 , pp. 19-22
    • Michie, D.1
  • 16
    • 84924316006 scopus 로고    scopus 로고
    • Technical Report Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA
    • Wingate D, Diuk C, O'Donnell T, Tenenbaum J, Gershman S (2013) Compositional policy priors. Technical Report MIT-CSAIL-TR 2013-007 (Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA).
    • (2013) Compositional Policy Priors
    • Wingate, D.1    Diuk, C.2    O'Donnell, T.3    Tenenbaum, J.4    Gershman, S.5
  • 17
    • 84950934893 scopus 로고
    • Bayes factors
    • Kass R, Raftery A (1995) Bayes factors. J Am Stat Assoc 90(430):773-795.
    • (1995) J Am Stat Assoc , vol.90 , Issue.430 , pp. 773-795
    • Kass, R.1    Raftery, A.2
  • 19
    • 79958143780 scopus 로고    scopus 로고
    • Speed/accuracy trade-off between the habitual and the goal-directed processes
    • Keramati M, Dezfouli A, Piray P (2011) Speed/accuracy trade-off between the habitual and the goal-directed processes. PLOS Comput Biol 7(5):e1002055.
    • (2011) PLOS Comput Biol , vol.7 , Issue.5 , pp. e1002055
    • Keramati, M.1    Dezfouli, A.2    Piray, P.3
  • 21
    • 84886934503 scopus 로고    scopus 로고
    • Goal neglect and knowledge chunking in the construction of novel behaviour
    • Bhandari A, Duncan J (2014) Goal neglect and knowledge chunking in the construction of novel behaviour. Cognition 130(1):11-30.
    • (2014) Cognition , vol.130 , Issue.1 , pp. 11-30
    • Bhandari, A.1    Duncan, J.2
  • 22
    • 0022055330 scopus 로고
    • Evidence of hierarchies in cognitive maps
    • Hirtle SC, Jonides J (1985) Evidence of hierarchies in cognitive maps. Mem Cognit 13(3):208-217.
    • (1985) Mem Cognit , vol.13 , Issue.3 , pp. 208-217
    • Hirtle, S.C.1    Jonides, J.2
  • 24
    • 14644414684 scopus 로고    scopus 로고
    • Fine-to-coarse' route planning and navigation in regionalized environments
    • Wiener JM, Mallot HA (2003) 'Fine-to-coarse' route planning and navigation in regionalized environments. Spat Cogn Comput 3:331-358.
    • (2003) Spat Cogn Comput , vol.3 , pp. 331-358
    • Wiener, J.M.1    Mallot, H.A.2
  • 26
    • 77953537026 scopus 로고    scopus 로고
    • Node centrality in weighted networks: Generalizing degree and shortest paths
    • Opsahl T, Agneessens F, Skvoretz J (2010) Node centrality in weighted networks: Generalizing degree and shortest paths. Soc Networks 32:245-251.
    • (2010) Soc Networks , vol.32 , pp. 245-251
    • Opsahl, T.1    Agneessens, F.2    Skvoretz, J.3
  • 30
    • 33644782012 scopus 로고    scopus 로고
    • Dynamic response-by-response models of matching behavior in rhesus monkeys
    • Lau B, Glimcher PW (2005) Dynamic response-by-response models of matching behavior in rhesus monkeys. J Exp Anal Behav 84(3):555-579.
    • (2005) J Exp Anal Behav , vol.84 , Issue.3 , pp. 555-579
    • Lau, B.1    Glimcher, P.W.2
  • 31
    • 34447632392 scopus 로고    scopus 로고
    • Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex
    • Seo H, Barraclough DJ, Lee D (2007) Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex. Cereb Cortex 17(Suppl 1):i110-i117.
    • (2007) Cereb Cortex , vol.17 , pp. i110-i117
    • Seo, H.1    Barraclough, D.J.2    Lee, D.3
  • 32
    • 0001240712 scopus 로고    scopus 로고
    • Experience-weighted attraction learning in coordination games: Probability rules, heterogeneity, and time-variation
    • Camerer C, Ho TH (1998) Experience-weighted attraction learning in coordination games: Probability rules, heterogeneity, and time-variation. J Math Psychol 42(2/3):305-326.
    • (1998) J Math Psychol , vol.42 , Issue.2-3 , pp. 305-326
    • Camerer, C.1    Ho, T.H.2
  • 33
    • 79952746011 scopus 로고    scopus 로고
    • Model-based influences on humans' choices and striatal prediction errors
    • Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ (2011) Model-based influences on humans' choices and striatal prediction errors. Neuron 69(6):1204-1215.
    • (2011) Neuron , vol.69 , Issue.6 , pp. 1204-1215
    • Daw, N.D.1    Gershman, S.J.2    Seymour, B.3    Dayan, P.4    Dolan, R.J.5
  • 34
    • 72849112662 scopus 로고    scopus 로고
    • Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task
    • Rutledge RB, et al. (2009) Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task. J Neurosci 29(48):15104-15114.
    • (2009) J Neurosci , vol.29 , Issue.48 , pp. 15104-15114
    • Rutledge, R.B.1
  • 35
    • 84860163389 scopus 로고    scopus 로고
    • Serotonin selectively modulates reward value in human decision-making
    • Seymour B, Daw ND, Roiser JP, Dayan P, Dolan R (2012) Serotonin selectively modulates reward value in human decision-making. J Neurosci 32(17):5833-5842.
    • (2012) J Neurosci , vol.32 , Issue.17 , pp. 5833-5842
    • Seymour, B.1    Daw, N.D.2    Roiser, J.P.3    Dayan, P.4    Dolan, R.5
  • 36
    • 0032421570 scopus 로고    scopus 로고
    • The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10
    • quiz 34-57
    • Sheehan DV, et al. (1998) The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry 59(Suppl 20):22-33, quiz 34-57.
    • (1998) J Clin Psychiatry , vol.59 , pp. 22-33
    • Sheehan, D.V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.