메뉴 건너뛰기




Volumn 369, Issue 1655, 2014, Pages

Model-based hierarchical reinforcement learning and human action control

Author keywords

Goal directed behaviour; Hierarchy; Reinforcement learning

Indexed keywords

DECISION MAKING; HIERARCHICAL SYSTEM; HUMAN BEHAVIOR; LEARNING; NUMERICAL MODEL;

EID: 84907487070     PISSN: 09628436     EISSN: 14712970     Source Type: Journal    
DOI: 10.1098/rstb.2013.0480     Document Type: Article
Times cited : (134)

References (69)
  • 1
    • 78649966665 scopus 로고    scopus 로고
    • Dopamine in motivational control: Rewarding, aversive and alerting
    • Bromberg-Martin ES, Matsumoto M, Hikosaka O. 2010 Dopamine in motivational control: rewarding, aversive and alerting. Neuron 68, 815–834. (doi:10.1016/j.neuron.2010.11.022)
    • (2010) Neuron , vol.68 , pp. 815-834
    • Bromberg-Martin, E.S.1    Matsumoto, M.2    Hikosaka, O.3
  • 2
    • 0037057808 scopus 로고    scopus 로고
    • Reward, motivation, and reinforcement learning
    • Dayan P, Balleine BW. 2002 Reward, motivation, and reinforcement learning. Neuron 36, 285–298. (doi:10.1016/S0896-6273(02)00963-7)
    • (2002) Neuron , vol.36 , pp. 285-298
    • Dayan, P.1    Balleine, B.W.2
  • 3
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and striatal systems for behavioral control
    • Daw ND, Niv Y, Dayan P. 2005 Uncertainty-based competition between prefrontal and striatal systems for behavioral control. Nat. Neurosci. 8, 1704–1711. (doi:10.1038/nn1560)
    • (2005) Nat. Neurosci , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 4
    • 84885802926 scopus 로고    scopus 로고
    • Goals and habits in the brain
    • Dolan RJ, Dayan P. 2013 Goals and habits in the brain. Neuron 80, 312–325. (doi:10.1016/j.neuron.2013.09.007)
    • (2013) Neuron , vol.80 , pp. 312-325
    • Dolan, R.J.1    Dayan, P.2
  • 5
    • 84872761547 scopus 로고    scopus 로고
    • The ubiquity of model-based reinforcement learning
    • Doll BB, Simon DA, Daw ND. 2012 The ubiquity of model-based reinforcement learning. Curr. Opin. Neurobiol. 22, 1075–1081. (doi:10.1016/j.conb.2012.08.003)
    • (2012) Curr. Opin. Neurobiol , vol.22 , pp. 1075-1081
    • Doll, B.B.1    Simon, D.A.2    Daw, N.D.3
  • 7
    • 84859737036 scopus 로고    scopus 로고
    • Goal directed decision making as probabilistic inference: A computational framework and potential neural correlates
    • Solway A, Botvinick MM. 2012 Goal directed decision making as probabilistic inference: a computational framework and potential neural correlates. Psychol. Rev. 119, 120–154. (doi:10.1037/a0026435)
    • (2012) Psychol. Rev , vol.119 , pp. 120-154
    • Solway, A.1    Botvinick, M.M.2
  • 8
    • 84907545889 scopus 로고    scopus 로고
    • The algorithmic anatomy of model-based evaluation
    • Daw ND, Dayan P. 2014 The algorithmic anatomy of model-based evaluation. Phil. Trans. R. Soc. B 369, 20130478. (doi:10.1098/rstb.2013.0478)
    • (2014) Phil. Trans. R. Soc. B , vol.369 , pp. 20130478
    • Daw, N.D.1    Dayan, P.2
  • 9
    • 0141988716 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Barto A, Mahadevan S. 2003 Recent advances in hierarchical reinforcement learning. Discrete Event Dyn. Syst. 13, 341–379. (doi:10.1023/A:1025696116075)
    • (2003) Discrete Event Dyn. Syst , vol.13 , pp. 341-379
    • Barto, A.1    Mahadevan, S.2
  • 10
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: A reinforcement-learning perspective
    • Botvinick M., Niv Y, Barto AC. 2009 Hierarchically organized behavior and its neural foundations: a reinforcement-learning perspective. Cognition 113, 262–280. (doi:10.1016/j.cognition.2008.08.011)
    • (2009) Cognition , vol.113 , pp. 262-280
    • Botvinick, M.1    Niv, Y.2    Barto, A.C.3
  • 12
    • 84859341150 scopus 로고    scopus 로고
    • Habits, action sequences and reinforcement learning
    • Dezfouli A, Balleine BW. 2012 Habits, action sequences and reinforcement learning. Eur. J. Neurosci. 35, 1036–1051. (doi:10.1111/j.1460-9568.2012.08050.x)
    • (2012) Eur. J. Neurosci , vol.35 , pp. 1036-1051
    • Dezfouli, A.1    Balleine, B.W.2
  • 13
    • 84880660982 scopus 로고    scopus 로고
    • The expected value of control: An integrative theory of anterior cingulate cortex function
    • Shenhav A, Botvinick M., Cohen JD. 2013 The expected value of control: an integrative theory of anterior cingulate cortex function. Neuron 79, 217–240. (doi:10.1016/j.neuron.2013.07.007)
    • (2013) Neuron , vol.79 , pp. 217-240
    • Shenhav, A.1    Botvinick, M.2    Cohen, J.D.3
  • 14
    • 84856318423 scopus 로고    scopus 로고
    • Motivation of extended behaviors by anterior cingulate cortex
    • Holroyd CB, Yeung N. 2012 Motivation of extended behaviors by anterior cingulate cortex. Trends Cogn. Sci. 16, 122–128. (doi:10.1016/j.tics.2011.12.008)
    • (2012) Trends Cogn. Sci , vol.16 , pp. 122-128
    • Holroyd, C.B.1    Yeung, N.2
  • 15
    • 84857211334 scopus 로고    scopus 로고
    • Mechanisms of hierarchical reinforcement learning in corticostraital circuits 1: Computational analysis
    • Frank MJ, Badre D. 2012 Mechanisms of hierarchical reinforcement learning in corticostraital circuits 1: computational analysis. Cereb. Cortex 22, 509–526. (doi:10.1093/cercor/bhr114)
    • (2012) Cereb. Cortex , vol.22 , pp. 509-526
    • Frank, M.J.1    Badre, D.2
  • 16
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton RS, Precup D, Singh S. 1999 Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning. Artif. Intell. 112, 181–211. (doi:10.1016/S0004-3702(99) 00052-1)
    • (1999) Artif. Intell , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 18
    • 84880688141 scopus 로고    scopus 로고
    • Multi-value-functions: Efficient automatic action hierarchies for multiple goal MDPs
    • Stockholm, Sweden, 31 July 1999, San Francisco, CA: Morgan Kaufmann
    • Moore AW, Baird L, Kaelbling L. 1999 Multi-value-functions: efficient automatic action hierarchies for multiple goal MDPs. In Proc. Int. Joint Conf. on Artificial Intelligence, Stockholm, Sweden, 31 July 1999, pp. 1316–1323. San Francisco, CA: Morgan Kaufmann.
    • (1999) Proc. Int. Joint Conf. on Artificial Intelligence , pp. 1316-1323
    • Moore, A.W.1    Baird, L.2    Kaelbling, L.3
  • 20
    • 84878190351 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning and decision making
    • Botvinick MM. 2012 Hierarchical reinforcement learning and decision making. Curr. Opin. Neurobiol. 22, 956–962. (doi:10.1016/j.conb.2012.05.008)
    • (2012) Curr. Opin. Neurobiol , vol.22 , pp. 956-962
    • Botvinick, M.M.1
  • 21
    • 84877341847 scopus 로고    scopus 로고
    • The curse of planning: Dissecting multiple reinforcement-learning systems by taxing the central executive
    • Otto AR, Gershman SJ, Markman AB, Daw ND. 2013 The curse of planning: dissecting multiple reinforcement-learning systems by taxing the central executive. Psychol. Sci. 24, 751–761. (doi:10.1177/0956797612463080)
    • (2013) Psychol. Sci , vol.24 , pp. 751-761
    • Otto, A.R.1    Gershman, S.J.2    Markman, A.B.3    Daw, N.D.4
  • 22
    • 0003506152 scopus 로고    scopus 로고
    • State abstraction in MAXQ hierarchical reinforcement learning
    • Colorado, 28 November 2000, Cambridge, MA: MIT Press
    • Dietterich TG. 2000 State abstraction in MAXQ hierarchical reinforcement learning. In Advances in Neural Information Processing, Denver, Colorado, 28 November 2000, pp. 994–1000. Cambridge, MA: MIT Press.
    • (2000) Advances in Neural Information Processing, Denver , pp. 994-1000
    • Dietterich, T.G.1
  • 24
    • 84871698013 scopus 로고    scopus 로고
    • Hierarchical task and motion planning in the now
    • Shanghai, China, 9 May 2011, Piscataway, NJ: IEEE Press
    • Kaelbling L., Lozano-Pérez T. 2011 Hierarchical task and motion planning in the now. In IEEE Int. Conf. on Robotics and Automation, Shanghai, China, 9 May 2011, pp. 1470–1477. Piscataway, NJ: IEEE Press.
    • (2011) IEEE Int. Conf. on Robotics and Automation , pp. 1470-1477
    • Kaelbling, L.1    Lozano-Pérez, T.2
  • 27
    • 84875468581 scopus 로고    scopus 로고
    • Two simultaneous, but separable, prediction errors in human ventral striatum
    • Diuk C, Tsai K, Wallis J, Botvinick M, Niv Y. 2013 Two simultaneous, but separable, prediction errors in human ventral striatum. J. Neurosci. 33, 5797–5805. (doi:10.1523/JNEUROSCI.5445-12.2013)
    • (2013) J. Neurosci , vol.33 , pp. 5797-5805
    • Diuk, C.1    Tsai, K.2    Wallis, J.3    Botvinick, M.4    Niv, Y.5
  • 29
    • 0023422739 scopus 로고
    • SOAR: An architecture for general intelligence
    • Laird JE, Newell A, Rosenbloom PS. 1987 SOAR: an architecture for general intelligence. Artif. Intell. 33, 1–64. (doi:10.1016/0004-3702(87)90050-6)
    • (1987) Artif. Intell , vol.33 , pp. 1-64
    • Laird, J.E.1    Newell, A.2    Rosenbloom, P.S.3
  • 30
    • 79951476983 scopus 로고    scopus 로고
    • Hierarchical control of cognitive processes: The case for skilled typewriting
    • (ed. BH Ross, New York, NY: Academic Press
    • Logan GD, Crump MJC. 2011 Hierarchical control of cognitive processes: the case for skilled typewriting. In The psychology of learning and motivation: advances in research and theory (ed. BH Ross), pp. 2–19. New York, NY: Academic Press.
    • (2011) The psychology of learning and motivation: advances in research and theory , pp. 2-19
    • Logan, G.D.1    Crump, J.C.2
  • 32
    • 1942443210 scopus 로고    scopus 로고
    • Doing without schema hierarchies: A recurrent connectionist approach to normal and impaired routine sequential action
    • Botvinick M, Plaut DC. 2004 Doing without schema hierarchies: a recurrent connectionist approach to normal and impaired routine sequential action. Psychol. Rev. 111, 395–429. (doi:10.1037/0033-295X.111.2.395)
    • (2004) Psychol. Rev , vol.111 , pp. 395-429
    • Botvinick, M.1    Plaut, D.C.2
  • 33
    • 0034075310 scopus 로고    scopus 로고
    • Contention scheduling and the control of routine activities. Cogn
    • Cooper R, Shallice T. 2000 Contention scheduling and the control of routine activities. Cogn. Neuropsychol. 17, 297–338. (doi:10.1080/026432900380427)
    • (2000) Neuropsychol , vol.17 , pp. 297-338
    • Cooper, R.1    Shallice, T.2
  • 34
    • 33750296630 scopus 로고    scopus 로고
    • Such stuff as habits are made on: A reply to Cooper and Shallice (2006
    • Botvinick M, Plaut DC. 2006 Such stuff as habits are made on: a reply to Cooper and Shallice (2006). Psychol. Rev. 113, 917–928. (doi:10.1037/0033-295X.113.4.917)
    • (2006) Psychol. Rev , vol.113 , pp. 917-928
    • Botvinick, M.1    Plaut, D.C.2
  • 36
    • 0004223940 scopus 로고
    • Cambridge, UK: Cambridge University Press
    • Reason JT. 1992 Human error. Cambridge, UK: Cambridge University Press.
    • (1992) Human error
    • Reason, J.T.1
  • 38
    • 0016069798 scopus 로고
    • Planning in a hierarchy of abstraction spaces
    • Sacerdoti ED. 1974 Planning in a hierarchy of abstraction spaces. Artif. Intell. 5, 115–135. (doi:10.1016/0004-3702(74)90026-5)
    • (1974) Artif. Intell , vol.5 , pp. 115-135
    • Sacerdoti, E.D.1
  • 39
    • 0018594651 scopus 로고
    • A cognitive model of planning
    • Hayes-Roth B, Hayes-Roth F. 1979 A cognitive model of planning. Cogn. Sci. 3, 275–310. (doi:10.1207/s15516709cog0304_1)
    • (1979) Cogn. Sci , vol.3 , pp. 275-310
    • Hayes-Roth, B.1    Hayes-Roth, F.2
  • 42
    • 84892682926 scopus 로고    scopus 로고
    • Actions, action sequences and decision-making: Evidence that goal-directed and habitual action control are hierarchically organized
    • Dezfouli A, Balleine BW. 2013 Actions, action sequences and decision-making: evidence that goal-directed and habitual action control are hierarchically organized. PLoS Comput. Biol. 9, e1003364. (doi:10.1371/journal.pcbi.1003364)
    • (2013) PLoS Comput. Biol , vol.9
    • Dezfouli, A.1    Balleine, B.W.2
  • 43
    • 84907480610 scopus 로고    scopus 로고
    • Habits as action sequences: Hierarchical action control and changes in outcome value
    • Dezfouli A, Lingawi NW, Balleine BW. 2014 Habits as action sequences: hierarchical action control and changes in outcome value. Phil. Trans. R. Soc. B 369, 20130482. (doi:10.1098/rstb.2013.0482)
    • (2014) Phil. Trans. R. Soc. B , vol.369 , pp. 20130482
    • Dezfouli, A.1    Lingawi, N.W.2    Balleine, B.W.3
  • 44
    • 67649342617 scopus 로고    scopus 로고
    • Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex
    • Ostlund SB, Winterbauer NE, Balleine BW. 2009 Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. J. Neurosci. 29, 8280–8287. (doi:10.1523/JNEUROSCI.1176-09.2009)
    • (2009) J. Neurosci , vol.29 , pp. 8280-8287
    • Ostlund, S.B.1    Winterbauer, N.E.2    Balleine, B.W.3
  • 45
    • 33746272406 scopus 로고
    • Duration neglect in retrospective evaluations of affective episodes
    • Fredrickson BL, Kahneman D. 1993 Duration neglect in retrospective evaluations of affective episodes. J. Pers. Soc. Psychol. 65, 45–55. (doi:10.1037/0022-3514.65.1.45)
    • (1993) J. Pers. Soc. Psychol , vol.65 , pp. 45-55
    • Fredrickson, B.L.1    Kahneman, D.2
  • 46
    • 0039786967 scopus 로고    scopus 로고
    • Gestalt characteristics of experiences: The defining features of summarized events
    • Ariely D, Carmon Z. 2000 Gestalt characteristics of experiences: the defining features of summarized events. J. Behav. Decis. Making 13, 191–201. (doi:10.1002/(SICI)1099-0771(200004/06)13: 2,191::AID-BDM330.3.0.CO;2-A)
    • (2000) J. Behav. Decis. Making , vol.13 , pp. 191-201
    • Ariely, D.1    Carmon, Z.2
  • 47
    • 84879603032 scopus 로고    scopus 로고
    • Hedonic evaluation over short and long retention intervals: The mechanism of the peak–end rule
    • Geng X, Chen Z, Lam W, Zheng Q. 2013 Hedonic evaluation over short and long retention intervals: the mechanism of the peak–end rule. J. Behav. Decis. Making 26, 225–236. (doi:10.1002/bdm.1755)
    • (2013) J. Behav. Decis. Making , vol.26 , pp. 225-236
    • Geng, X.1    Chen, Z.2    Lam, W.3    Zheng, Q.4
  • 48
    • 33747688922 scopus 로고    scopus 로고
    • Optimal predictions in everyday cognition
    • Griffiths TL, Tenenbaum JB. 2006 Optimal predictions in everyday cognition. Psychol. Sci. 17, 767–773. (doi:10.1111/j.1467-9280.2006.01780.x)
    • (2006) Psychol. Sci , vol.17 , pp. 767-773
    • Griffiths, T.L.1    Tenenbaum, J.B.2
  • 49
    • 82855178982 scopus 로고    scopus 로고
    • Predicting the future as Bayesian Inference: People combine prior knowledge with observations when estimating duration and extent
    • Griffiths TL, Tenenbaum JB. 2011 Predicting the future as Bayesian Inference: people combine prior knowledge with observations when estimating duration and extent. J. Exp. Psychol. Gen. 140, 725–743. (doi:10.1037/a0024899)
    • (2011) J. Exp. Psychol. Gen , vol.140 , pp. 725-743
    • Griffiths, T.L.1    Tenenbaum, J.B.2
  • 50
    • 26844492851 scopus 로고    scopus 로고
    • Underestimating the duration of future events: Memory incorrectly used or memory bias?
    • Roy MM, Christenfeld NJ, McKenzie CRM. 2005 Underestimating the duration of future events: memory incorrectly used or memory bias? Psychol. Bull. 131, 738–756. (doi:10.1037/0033-2909.131.5.738)
    • (2005) Psychol. Bull , vol.131 , pp. 738-756
    • Roy, M.M.1    Christenfeld, N.J.2    McKenzie, R.M.3
  • 52
    • 14644414684 scopus 로고    scopus 로고
    • ‘Fine-to-coarse’ route planning and navigation in regionalized environments
    • Wiener JM, Mallot HA. 2003 ‘Fine-to-coarse’ route planning and navigation in regionalized environments. Spat. Cogn. Comput. 3, 331–358. (doi:10.1207/s15427633scc0304_5)
    • (2003) Spat. Cogn. Comput , vol.3 , pp. 331-358
    • Wiener, J.M.1    Mallot, H.A.2
  • 53
    • 33750705246 scopus 로고    scopus 로고
    • Casual graph based decomposition of factored MDPs
    • Jonsson A, Barto AG. 2006 Casual graph based decomposition of factored MDPs. J. Mach. Learn. Res. 7, 2259–2301.
    • (2006) J. Mach. Learn. Res , vol.7 , pp. 2259-2301
    • Jonsson, A.1    Barto, A.G.2
  • 54
    • 80054969173 scopus 로고    scopus 로고
    • Intrinsically motivated hieararchical skill learning in structured environments
    • Vigorito CM, Barto AG. 2010 Intrinsically motivated hieararchical skill learning in structured environments. IEEE Trans. Auton. Ment. Dev. (T-AMD) 2, 83–90. (doi:10.1109/TAMD.2010.2051436)
    • (2010) IEEE Trans. Auton. Ment. Dev. (T-AMD) , vol.2 , pp. 83-90
    • Vigorito, C.M.1    Barto, A.G.2
  • 55
    • 0013465036 scopus 로고    scopus 로고
    • Discovering hierarchy in reinforcement learning with HEXQ
    • Hengst B. 2002 Discovering hierarchy in reinforcement learning with HEXQ. Proc. Int. Conf. Mach. Learn. 19, 243–250.
    • (2002) Proc. Int. Conf. Mach. Learn , vol.19 , pp. 243-250
    • Hengst, B.1
  • 56
    • 84858634841 scopus 로고    scopus 로고
    • Autonomous learning of high-level states and actions in continuous environments
    • Mugan J, Kuipers B. 2012 Autonomous learning of high-level states and actions in continuous environments. IEEE Trans. Auton. Ment. Dev. 4, 70–86. (doi:10.1109/TAMD.2011.2160943)
    • (2012) IEEE Trans. Auton. Ment. Dev , vol.4 , pp. 70-86
    • Mugan, J.1    Kuipers, B.2
  • 57
    • 84883172973 scopus 로고    scopus 로고
    • Computational models of executive control: Charted territory and new frontiers
    • In press
    • Botvinick M, Cohen JD. In press. Computational models of executive control: charted territory and new frontiers. Cogn. Sci.
    • Cogn. Sci
    • Botvinick, M.1    Cohen, J.D.2
  • 58
    • 84875674596 scopus 로고    scopus 로고
    • Neural representations of events arise from temporal community structure
    • Schapiro A, Cordova N, Turk-Browne N, Rogers TT, Botvinick MM. 2013 Neural representations of events arise from temporal community structure. Nat. Neurosci. 16, 486–492. (doi:10.1038/nn.3331)
    • (2013) Nat. Neurosci , vol.16 , pp. 486-492
    • Schapiro, A.1    Cordova, N.2    Turk-Browne, N.3    Rogers, T.T.4    Botvinick, M.M.5
  • 59
    • 0001158047 scopus 로고
    • Improving generalization for temporal difference learning: The successor representation
    • Dayan P. 1993 Improving generalization for temporal difference learning: the successor representation. Neural Comput. 5, 613–624. (doi:10.1162/neco.1993.5.4.613)
    • (1993) Neural Comput , vol.5 , pp. 613-624
    • Dayan, P.1
  • 60
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
    • Glascher J, Daw N, Dayan P, O’Doherty JP. 2010 States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 66, 585–595. (doi:10.1016/j.neuron.2010.04.016)
    • (2010) Neuron , vol.66 , pp. 585-595
    • Glascher, J.1    Daw, N.2    Dayan, P.3    O’doherty, J.P.4
  • 61
    • 42749096312 scopus 로고    scopus 로고
    • Cognitive control, hierarchy, and the rostro–caudal organization of the frontal lobes
    • Badre D. 2008 Cognitive control, hierarchy, and the rostro–caudal organization of the frontal lobes. Trends Cogn. Sci. 12, 193–200. (doi:10.1016/j.tics.2008.02.004)
    • (2008) Trends Cogn. Sci , vol.12 , pp. 193-200
    • Badre, D.1
  • 62
    • 0242497620 scopus 로고    scopus 로고
    • The architecture of cognitive control in the human prefrontal cortex
    • Koechlin E, Ody C, Kouneiher F. 2003 The architecture of cognitive control in the human prefrontal cortex. Science 302, 1181–1185. (doi:10.1126/science.1088545)
    • (2003) Science , vol.302 , pp. 1181-1185
    • Koechlin, E.1    Ody, C.2    Kouneiher, F.3
  • 63
    • 84896322734 scopus 로고    scopus 로고
    • Task difficulty manipulation reveals multiple demand activity but no frontal lobe hierarchy
    • Crittenden BM, Duncan J. 2012 Task difficulty manipulation reveals multiple demand activity but no frontal lobe hierarchy. Cereb. Cortex 24, 532–540. (doi:10.1093/cercor/bhs333)
    • (2012) Cereb. Cortex , vol.24 , pp. 532-540
    • Crittenden, B.M.1    Duncan, J.2
  • 64
    • 84906983504 scopus 로고    scopus 로고
    • Prefrontal cortex organization: Dissociating effects of temporal abstraction, relational abstraction, and integration with fMRI
    • Nee DE, Jahn A, Brown JW. 2013 Prefrontal cortex organization: dissociating effects of temporal abstraction, relational abstraction, and integration with fMRI. Cereb. Cortex 24, 2377–2387. (doi:10.1093/cercor/bht091)
    • (2013) Cereb. Cortex , vol.24 , pp. 2377-2387
    • Nee, D.E.1    Jahn, A.2    Brown, J.W.3
  • 65
    • 84857065417 scopus 로고    scopus 로고
    • The function and organization of lateral prefrontal cortex: A test of competing hypotheses
    • Reynolds JR, O’Reilly RC, Cohen JD, Braver TS. 2012 The function and organization of lateral prefrontal cortex: a test of competing hypotheses. PLoS ONE 7, e30284. (doi:10.1371/journal.pone.0030284)
    • (2012) PLoS ONE , vol.7
    • Reynolds, J.R.1    O’reilly, R.C.2    Cohen, J.D.3    Braver, T.S.4
  • 66
    • 33846607753 scopus 로고    scopus 로고
    • Self-projection and the brain
    • Buckner RL, Carroll DC. 2006 Self-projection and the brain. Trends Cogn. Sci. 11, 49–57. (doi:10.1016/j.tics.2006.11.004)
    • (2006) Trends Cogn. Sci , vol.11 , pp. 49-57
    • Buckner, R.L.1    Carroll, D.C.2
  • 68
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Austin, Texas, 21 June 1990, San Francisco, CA: Morgan Kaufmann
    • Sutton RS. 1990 Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proc. Seventh International Conference on Machine Learning, Austin, Texas, 21 June 1990, pp. 216–224. San Francisco, CA: Morgan Kaufmann.
    • (1990) Proc. Seventh International Conference on Machine Learning , pp. 216-224
    • Sutton, R.S.1
  • 69
    • 38149106939 scopus 로고    scopus 로고
    • A biologically plausible model of human planning based on neural networks and Dyna-PI models
    • Baldassarre G. 2002 A biologically plausible model of human planning based on neural networks and Dyna-PI models. In Workshop on Adaptive Behaviour in Anticipatory Learning Systems, pp. 40–60.
    • (2002) Workshop on Adaptive Behaviour in Anticipatory Learning Systems , pp. 40-60
    • Baldassarre, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.