-
1
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Sutton RS, Precup D, Singh S (1999) Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artif Intell 112:181-211.
-
(1999)
Artif Intell
, vol.112
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
-
2
-
-
70350566799
-
Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
-
Botvinick MM, Niv Y, Barto AC (2009) Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition 113(3):262-280.
-
(2009)
Cognition
, vol.113
, Issue.3
, pp. 262-280
-
-
Botvinick, M.M.1
Niv, Y.2
Barto, A.C.3
-
3
-
-
28044450875
-
Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
-
Daw ND, Niv Y, Dayan P (2005) Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci 8(12):1704-1711.
-
(2005)
Nat Neurosci
, vol.8
, Issue.12
, pp. 1704-1711
-
-
Daw, N.D.1
Niv, Y.2
Dayan, P.3
-
4
-
-
84859371025
-
Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees
-
Huys QJM, et al. (2012) Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees. PLOS Comput Biol 8(3):e1002410.
-
(2012)
PLOS Comput Biol
, vol.8
, Issue.3
, pp. e1002410
-
-
Huys, Q.J.M.1
-
5
-
-
84859341150
-
Habits, action sequences and reinforcement learning
-
Dezfouli A, Balleine BW (2012) Habits, action sequences and reinforcement learning. Eur J Neurosci 35(7):1036-1051.
-
(2012)
Eur J Neurosci
, vol.35
, Issue.7
, pp. 1036-1051
-
-
Dezfouli, A.1
Balleine, B.W.2
-
6
-
-
84892682926
-
Actions, action sequences and habits: Evidence that goal-directed and habitual action control are hierarchically organized
-
Dezfouli A, Balleine BW (2013) Actions, action sequences and habits: Evidence that goal-directed and habitual action control are hierarchically organized. PLOS Comput Biol 9(12):e1003364.
-
(2013)
PLOS Comput Biol
, vol.9
, Issue.12
, pp. e1003364
-
-
Dezfouli, A.1
Balleine, B.W.2
-
7
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
Dietterich TG (2000) Hierarchical reinforcement learning with the MAXQ value function decomposition. J Artif Intell Res 13:227-303.
-
(2000)
J Artif Intell Res
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
8
-
-
0033607507
-
Building neural representations of habits
-
Jog MS, Kubota Y, Connolly CI, Hillegaart V, Graybiel AM (1999) Building neural representations of habits. Science 286(5445):1745-1749.
-
(1999)
Science
, vol.286
, Issue.5445
, pp. 1745-1749
-
-
Jog, M.S.1
Kubota, Y.2
Connolly, C.I.3
Hillegaart, V.4
Graybiel, A.M.5
-
9
-
-
0242497620
-
The architecture of cognitive control in the human prefrontal cortex
-
Koechlin E, Ody C, Kouneiher F (2003) The architecture of cognitive control in the human prefrontal cortex. Science 302(5648):1181-1185.
-
(2003)
Science
, vol.302
, Issue.5648
, pp. 1181-1185
-
-
Koechlin, E.1
Ody, C.2
Kouneiher, F.3
-
10
-
-
42749096312
-
Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes
-
Badre D (2008) Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes. Trends Cogn Sci 12(5):193-200.
-
(2008)
Trends Cogn Sci
, vol.12
, Issue.5
, pp. 193-200
-
-
Badre, D.1
-
11
-
-
67649342617
-
Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex
-
Ostlund SB, Winterbauer NE, Balleine BW (2009) Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. J Neurosci 29(25):8280-8287.
-
(2009)
J Neurosci
, vol.29
, Issue.25
, pp. 8280-8287
-
-
Ostlund, S.B.1
Winterbauer, N.E.2
Balleine, B.W.3
-
12
-
-
78049389920
-
Cognitive illusions of authorship reveal hierarchical error detection in skilled typists
-
Logan GD, Crump MJC (2010) Cognitive illusions of authorship reveal hierarchical error detection in skilled typists. Science 330(6004):683-686.
-
(2010)
Science
, vol.330
, Issue.6004
, pp. 683-686
-
-
Logan, G.D.1
Crump, M.J.C.2
-
13
-
-
0002444193
-
Memo functions and machine learning
-
Michie D (1968) Memo functions and machine learning. Nature 218:19-22.
-
(1968)
Nature
, vol.218
, pp. 19-22
-
-
Michie, D.1
-
14
-
-
80052194188
-
-
Technical Report Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA
-
O'Donnell TJ, Goodman ND, Tenenbaum JB (2009) Fragment grammars: Exploring computation and reuse in language. Technical Report MIT-CSAIL-TR-2009-013 (Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA).
-
(2009)
Fragment Grammars: Exploring Computation and Reuse in Language
-
-
O'Donnell, T.J.1
Goodman, N.D.2
Tenenbaum, J.B.3
-
16
-
-
84924316006
-
-
Technical Report Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA
-
Wingate D, Diuk C, O'Donnell T, Tenenbaum J, Gershman S (2013) Compositional policy priors. Technical Report MIT-CSAIL-TR 2013-007 (Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA).
-
(2013)
Compositional Policy Priors
-
-
Wingate, D.1
Diuk, C.2
O'Donnell, T.3
Tenenbaum, J.4
Gershman, S.5
-
18
-
-
33749249312
-
Hierarchical Dirichlet processes
-
Teh YW, Jordan MI, Beal MJ, Blei DM (2006) Hierarchical Dirichlet processes. J Am Stat Assoc 101(476):1566-1581.
-
(2006)
J Am Stat Assoc
, vol.101
, Issue.476
, pp. 1566-1581
-
-
Teh, Y.W.1
Jordan, M.I.2
Beal, M.J.3
Blei, D.M.4
-
19
-
-
79958143780
-
Speed/accuracy trade-off between the habitual and the goal-directed processes
-
Keramati M, Dezfouli A, Piray P (2011) Speed/accuracy trade-off between the habitual and the goal-directed processes. PLOS Comput Biol 7(5):e1002055.
-
(2011)
PLOS Comput Biol
, vol.7
, Issue.5
, pp. e1002055
-
-
Keramati, M.1
Dezfouli, A.2
Piray, P.3
-
21
-
-
84886934503
-
Goal neglect and knowledge chunking in the construction of novel behaviour
-
Bhandari A, Duncan J (2014) Goal neglect and knowledge chunking in the construction of novel behaviour. Cognition 130(1):11-30.
-
(2014)
Cognition
, vol.130
, Issue.1
, pp. 11-30
-
-
Bhandari, A.1
Duncan, J.2
-
22
-
-
0022055330
-
Evidence of hierarchies in cognitive maps
-
Hirtle SC, Jonides J (1985) Evidence of hierarchies in cognitive maps. Mem Cognit 13(3):208-217.
-
(1985)
Mem Cognit
, vol.13
, Issue.3
, pp. 208-217
-
-
Hirtle, S.C.1
Jonides, J.2
-
24
-
-
14644414684
-
Fine-to-coarse' route planning and navigation in regionalized environments
-
Wiener JM, Mallot HA (2003) 'Fine-to-coarse' route planning and navigation in regionalized environments. Spat Cogn Comput 3:331-358.
-
(2003)
Spat Cogn Comput
, vol.3
, pp. 331-358
-
-
Wiener, J.M.1
Mallot, H.A.2
-
26
-
-
77953537026
-
Node centrality in weighted networks: Generalizing degree and shortest paths
-
Opsahl T, Agneessens F, Skvoretz J (2010) Node centrality in weighted networks: Generalizing degree and shortest paths. Soc Networks 32:245-251.
-
(2010)
Soc Networks
, vol.32
, pp. 245-251
-
-
Opsahl, T.1
Agneessens, F.2
Skvoretz, J.3
-
27
-
-
84875674596
-
Neural representations of events arise from temporal community structure
-
Schapiro AC, Rogers TT, Cordova NI, Turk-Browne NB, Botvinick MM (2013) Neural representations of events arise from temporal community structure. Nat Neurosci 16(4):486-492.
-
(2013)
Nat Neurosci
, vol.16
, Issue.4
, pp. 486-492
-
-
Schapiro, A.C.1
Rogers, T.T.2
Cordova, N.I.3
Turk-Browne, N.B.4
Botvinick, M.M.5
-
30
-
-
33644782012
-
Dynamic response-by-response models of matching behavior in rhesus monkeys
-
Lau B, Glimcher PW (2005) Dynamic response-by-response models of matching behavior in rhesus monkeys. J Exp Anal Behav 84(3):555-579.
-
(2005)
J Exp Anal Behav
, vol.84
, Issue.3
, pp. 555-579
-
-
Lau, B.1
Glimcher, P.W.2
-
31
-
-
34447632392
-
Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex
-
Seo H, Barraclough DJ, Lee D (2007) Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex. Cereb Cortex 17(Suppl 1):i110-i117.
-
(2007)
Cereb Cortex
, vol.17
, pp. i110-i117
-
-
Seo, H.1
Barraclough, D.J.2
Lee, D.3
-
32
-
-
0001240712
-
Experience-weighted attraction learning in coordination games: Probability rules, heterogeneity, and time-variation
-
Camerer C, Ho TH (1998) Experience-weighted attraction learning in coordination games: Probability rules, heterogeneity, and time-variation. J Math Psychol 42(2/3):305-326.
-
(1998)
J Math Psychol
, vol.42
, Issue.2-3
, pp. 305-326
-
-
Camerer, C.1
Ho, T.H.2
-
33
-
-
79952746011
-
Model-based influences on humans' choices and striatal prediction errors
-
Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ (2011) Model-based influences on humans' choices and striatal prediction errors. Neuron 69(6):1204-1215.
-
(2011)
Neuron
, vol.69
, Issue.6
, pp. 1204-1215
-
-
Daw, N.D.1
Gershman, S.J.2
Seymour, B.3
Dayan, P.4
Dolan, R.J.5
-
34
-
-
72849112662
-
Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task
-
Rutledge RB, et al. (2009) Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task. J Neurosci 29(48):15104-15114.
-
(2009)
J Neurosci
, vol.29
, Issue.48
, pp. 15104-15114
-
-
Rutledge, R.B.1
-
35
-
-
84860163389
-
Serotonin selectively modulates reward value in human decision-making
-
Seymour B, Daw ND, Roiser JP, Dayan P, Dolan R (2012) Serotonin selectively modulates reward value in human decision-making. J Neurosci 32(17):5833-5842.
-
(2012)
J Neurosci
, vol.32
, Issue.17
, pp. 5833-5842
-
-
Seymour, B.1
Daw, N.D.2
Roiser, J.P.3
Dayan, P.4
Dolan, R.5
-
36
-
-
0032421570
-
The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10
-
quiz 34-57
-
Sheehan DV, et al. (1998) The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry 59(Suppl 20):22-33, quiz 34-57.
-
(1998)
J Clin Psychiatry
, vol.59
, pp. 22-33
-
-
Sheehan, D.V.1
|