메뉴 건너뛰기




Volumn 12, Issue 104, 2015, Pages

Divide et impera: Subgoaling reduces the complexity of probabilistic inference and problem solving

Author keywords

Active inference; Hierarchies; Model based reinforcement learning; Planning as inference; Problem solving; Subgoals

Indexed keywords

REINFORCEMENT LEARNING; SOCIAL NETWORKING (ONLINE);

EID: 84923247065     PISSN: 17425689     EISSN: 17425662     Source Type: Journal    
DOI: 10.1098/rsif.2014.1335     Document Type: Article
Times cited : (56)

References (55)
  • 3
    • 0034928713 scopus 로고    scopus 로고
    • An integrative theory of prefrontal cortex function
    • Miller EK, Cohen JD. 2001 An integrative theory of prefrontal cortex function. Annu. Rev. Neurosci. 24, 167-202. (doi:10. 1146/annurev. neuro. 24. 1. 167)
    • (2001) Annu. Rev. Neurosci. , vol.24 , pp. 167-202
    • Miller, E.K.1    Cohen, J.D.2
  • 5
    • 67449113542 scopus 로고    scopus 로고
    • Thinking as the control of imagination: A conceptual framework for goal-directed systems
    • Pezzulo G, Castelfranchi C. 2009 Thinking as the control of imagination: a conceptual framework for goal-directed systems. Psychol. Res. 73, 559-577. (doi:10. 1007/s00426-009-0237-z)
    • (2009) Psychol. Res. , vol.73 , pp. 559-577
    • Pezzulo, G.1    Castelfranchi, C.2
  • 6
    • 33646431689 scopus 로고    scopus 로고
    • Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans
    • Mushiake H, Saito N, Sakamoto K, Itoyama Y, Tanji J. 2006 Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans. Neuron 50, 631-641. (doi:10. 1016/j. neuron. 2006. 03. 045)
    • (2006) Neuron , vol.50 , pp. 631-641
    • Mushiake, H.1    Saito, N.2    Sakamoto, K.3    Itoyama, Y.4    Tanji, J.5
  • 7
    • 25144449580 scopus 로고    scopus 로고
    • Representation of immediate and final behavioral goals in the monkey prefrontal cortex during an instructed delay period
    • Saito N, Mushiake H, Sakamoto K, Itoyama Y, Tanji J. 2005 Representation of immediate and final behavioral goals in the monkey prefrontal cortex during an instructed delay period. Cereb. Cortex 15, 1535-1546. (doi:10. 1093/cercor/bhi032)
    • (2005) Cereb. Cortex , vol.15 , pp. 1535-1546
    • Saito, N.1    Mushiake, H.2    Sakamoto, K.3    Itoyama, Y.4    Tanji, J.5
  • 8
    • 0020490787 scopus 로고
    • Specific impairments of planning
    • Shallice T. 1982 Specific impairments of planning. Phil. Trans. R. Soc. Lond. B 298, 199-209. (doi:10. 1098/rstb. 1982. 0082)
    • (1982) Phil. Trans. R. Soc. Lond. B , vol.298 , pp. 199-209
    • Shallice, T.1
  • 9
    • 43049099970 scopus 로고    scopus 로고
    • Hierarchical models of behavior and prefrontal function
    • Botvinick MM. 2008 Hierarchical models of behavior and prefrontal function. Trends Cogn. Sci. 12, 201-208. (doi:10. 1016/j. tics. 2008. 02. 009)
    • (2008) Trends Cogn. Sci. , vol.12 , pp. 201-208
    • Botvinick, M.M.1
  • 11
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton RS, Precup D, Singh S. 1999 Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning. Artif. Intell. 112, 181-211. (doi:10. 1016/S0004-3702(99) 00052-1)
    • (1999) Artif. Intell. , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 12
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Barto AG, Mahadevan S. 2003 Recent advances in hierarchical reinforcement learning. Discr. Event Dyn. Syst. 13, 341-379. (doi:10. 1023/A:102569 6116075)
    • (2003) Discr. Event Dyn. Syst. , vol.13 , pp. 341-379
    • Barto, A.G.1    Mahadevan, S.2
  • 13
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
    • Botvinick M, Niv Y, Barto A. 2009 Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective. Cognition 119, 262-280. (doi:10. 1016/j. cognition. 2008. 08. 011)
    • (2009) Cognition , vol.119 , pp. 262-280
    • Botvinick, M.1    Niv, Y.2    Barto, A.3
  • 14
    • 79959957608 scopus 로고    scopus 로고
    • Hierarchical behaviours: Getting the most bang for your bit
    • Darwin Meets von Neumann. Berlin, Germany: Springer
    • van Dijk SG, Polani D, Nehaniv CL. 2011 Hierarchical behaviours: getting the most bang for your bit. In Advances in artificial life. Darwin Meets von Neumann, pp. 342-349. Berlin, Germany: Springer.
    • (2011) Advances in Artificial Life , pp. 342-349
    • Van Dijk, S.G.1    Polani, D.2    Nehaniv, C.L.3
  • 17
    • 84878786771 scopus 로고    scopus 로고
    • Width and serialization of classical planning problems
    • Montpellier, France, 27-31 August. Clifton, VA: IOS Press
    • Lipovetzky N, Geffner H. 2012 Width and serialization of classical planning problems. In ECAI 2012: 20th European Conf. on Artificial Intelligence, Montpellier, France, 27-31 August, pp. 540-545. Clifton, VA: IOS Press. (doi:10. 3233/978-1-61499-098-7-540)
    • (2012) ECAI 2012: 20th European Conf. on Artificial Intelligence , pp. 540-545
    • Lipovetzky, N.1    Geffner, H.2
  • 18
    • 84871869326 scopus 로고    scopus 로고
    • Programming in the brain: A neural network theoretical framework
    • Donnarumma F, Prevete R, Trautteur G. 2012 Programming in the brain: a neural network theoretical framework. Connect. Sci. 24, 71-90. (doi:10. 1080/09540091. 2012. 684670)
    • (2012) Connect. Sci. , vol.24 , pp. 71-90
    • Donnarumma, F.1    Prevete, R.2    Trautteur, G.3
  • 19
    • 0031194381 scopus 로고    scopus 로고
    • Discovering neural nets with low Kolmogorov complexity and high generalization capability
    • Schmidhuber J. 1997 Discovering neural nets with low Kolmogorov complexity and high generalization capability. Neural Netw. 10, 10-15. (doi:10. 1016/ S0893-6080(96)00127-X)
    • (1997) Neural Netw. , vol.10 , pp. 10-15
    • Schmidhuber, J.1
  • 20
    • 0141657088 scopus 로고    scopus 로고
    • Performance in planning: Processes, requirements, and errors
    • Mumford MD, Schultz RA, Van Doorn JR. 2001 Performance in planning: processes, requirements, and errors. Rev. Gen. Psychol. 5, 213-240. (doi:10. 1037/1089-2680. 5. 3. 213)
    • (2001) Rev. Gen. Psychol. , vol.5 , pp. 213-240
    • Mumford, M.D.1    Schultz, R.A.2    Van Doorn, J.R.3
  • 21
    • 0005822655 scopus 로고
    • Subgoal length versus full solution length in predicting Tower of Hanoi problem-solving performance
    • Spitz HH, Minsky SK, Bessellieu CL. 1984 Subgoal length versus full solution length in predicting Tower of Hanoi problem-solving performance. Bull. Psychon. Soc. 22, 301-304. (doi:10. 3758/ BF03333826)
    • (1984) Bull. Psychon. Soc. , vol.22 , pp. 301-304
    • Spitz, H.H.1    Minsky, S.K.2    Bessellieu, C.L.3
  • 22
    • 79955709936 scopus 로고    scopus 로고
    • Neural correlates of forward planning in a spatial decision task in humans
    • Simon DA, Daw ND. 2011 Neural correlates of forward planning in a spatial decision task in humans. J. Neurosci. 31, 5526-5539. (doi:10. 1523/ JNEUROSCI. 4647-10. 2011)
    • (2011) J. Neurosci. , vol.31 , pp. 5526-5539
    • Simon, D.A.1    Daw, N.D.2
  • 24
    • 33749242151 scopus 로고    scopus 로고
    • Planning by probabilistic inference
    • Key West, Florida, 3-6 January. New Jersey: Society for Artificial Intelligence and Statistics
    • Attias H. 2003 Planning by probabilistic inference. In Proc. 9th Int. Workshop on Artificial Intelligence and Statistics, Key West, Florida, 3-6 January. New Jersey: Society for Artificial Intelligence and Statistics.
    • (2003) Proc. 9th Int. Workshop on Artificial Intelligence and Statistics
    • Attias, H.1
  • 25
    • 84866531311 scopus 로고    scopus 로고
    • Planning as inference
    • Botvinick M, Toussaint M. 2012 Planning as inference. Trends Cogn. Sci. 16, 485-488. (doi:10. 1016/j. tics. 2012. 08. 006)
    • (2012) Trends Cogn. Sci. , vol.16 , pp. 485-488
    • Botvinick, M.1    Toussaint, M.2
  • 26
    • 84878783112 scopus 로고    scopus 로고
    • The mixed instrumental controller: Using value of information to combine habitual choice and mental simulation
    • Pezzulo G, Rigoli F, Chersi F. 2013 The mixed instrumental controller: using value of information to combine habitual choice and mental simulation. Front. Psychol. 4, 92. (doi:10. 3389/fpsyg. 2013. 00092)
    • (2013) Front. Psychol. , vol.4 , pp. 92
    • Pezzulo, G.1    Rigoli, F.2    Chersi, F.3
  • 27
    • 33749234798 scopus 로고    scopus 로고
    • Probabilistic inference for solving discrete and continuous state Markov decision processes
    • Pittsburgh, Pennsylvania, 25-29 June. New York, NY: ACM
    • Toussaint M, Storkey A. 2006 Probabilistic inference for solving discrete and continuous state Markov decision processes. In Proc. 23rd Int. Conf. on Machine learning, Pittsburgh, Pennsylvania, 25-29 June, pp. 945-952. New York, NY: ACM.
    • (2006) Proc. 23rd Int. Conf. on Machine Learning , pp. 945-952
    • Toussaint, M.1    Storkey, A.2
  • 28
    • 85161968427 scopus 로고    scopus 로고
    • Goal-directed decision making in prefrontal cortex: A computational framework
    • Vancouver, Canada, 8-11 December. Cambridge, MA: MIT Press
    • Botvinick MM, An J. 2008 Goal-directed decision making in prefrontal cortex: a computational framework. In Advances in Neural Information Processing Systems (NIPS), Vancouver, Canada, 8-11 December. Cambridge, MA: MIT Press.
    • (2008) Advances in Neural Information Processing Systems (NIPS)
    • Botvinick, M.M.1    An, J.2
  • 29
    • 84859737036 scopus 로고    scopus 로고
    • Goal-directed decision making as probabilistic inference: A computational framework and potential neural correlates
    • Solway A, Botvinick MM. 2012 Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates. Psychol. Rev. 119, 120-154. (doi:10. 1037/a0026435)
    • (2012) Psychol. Rev. , vol.119 , pp. 120-154
    • Solway, A.1    Botvinick, M.M.2
  • 30
    • 68149131857 scopus 로고    scopus 로고
    • Reinforcement learning or active inference
    • Friston KJ, Daunizeau J, Kiebel SJ. 2009 Reinforcement learning or active inference? PLoS ONE 4, e6421. (doi:10. 1371/journal. pone. 0006421)
    • (2009) PLoS ONE , vol.4 , pp. e6421
    • Friston, K.J.1    Daunizeau, J.2    Kiebel, S.J.3
  • 33
    • 34250613841 scopus 로고    scopus 로고
    • Planning and acting in uncertain environments using probabilistic inference
    • Beijing, China, 9-15 October Piscataway, NJ: IEEE
    • Verma D, Rao RPN. 2006 Planning and acting in uncertain environments using probabilistic inference. In IROS, Beijing, China, 9-15 October, pp. 2382-2387. Piscataway, NJ: IEEE.
    • (2006) IROS , pp. 2382-2387
    • Verma, D.1    Rao, R.P.N.2
  • 34
    • 0031212924 scopus 로고    scopus 로고
    • The discovery of algorithmic probability
    • Solomonoff RJ. 1997 The discovery of algorithmic probability. J. Comp. Syst. Sci. 55, 73-88. (doi:10. 1006/jcss. 1997. 1500)
    • (1997) J. Comp. Syst. Sci. , vol.55 , pp. 73-88
    • Solomonoff, R.J.1
  • 36
    • 0001460136 scopus 로고    scopus 로고
    • On sequential Monte Carlo sampling methods for Bayesian filtering
    • Doucet A, Godsill S, Andrieu C. 2000 On sequential Monte Carlo sampling methods for Bayesian filtering. Stat. Comput. 10, 197-208. (doi:10. 1023/ A:1008935410038)
    • (2000) Stat. Comput. , vol.10 , pp. 197-208
    • Doucet, A.1    Godsill, S.2    Andrieu, C.3
  • 37
    • 84879205615 scopus 로고    scopus 로고
    • Computational models of planning
    • Geffner H. 2013 Computational models of planning. Wiley Interdiscip. Rev. Cogn. Sci. 4, 341-356. (doi:10. 1002/wcs. 1233)
    • (2013) Wiley Interdiscip. Rev. Cogn. Sci. , vol.4 , pp. 341-356
    • Geffner, H.1
  • 39
    • 75549090229 scopus 로고    scopus 로고
    • The free-energy principle: A unified brain theory
    • Friston K. 2010 The free-energy principle: a unified brain theory? Nat. Rev. Neurosci. 11, 127-138. (doi:10. 1038/nrn2787)
    • (2010) Nat. Rev. Neurosci. , vol.11 , pp. 127-138
    • Friston, K.1
  • 40
    • 84861444112 scopus 로고    scopus 로고
    • Encoding goals but not abstract magnitude in the primate prefrontal cortex
    • Genovesio A, Tsujimoto S, Wise SP. 2012 Encoding goals but not abstract magnitude in the primate prefrontal cortex. Neuron 74, 656-662. (doi:10. 1016/j. neuron. 2012. 02. 023)
    • (2012) Neuron , vol.74 , pp. 656-662
    • Genovesio, A.1    Tsujimoto, S.2    Wise, S.P.3
  • 41
    • 84867578045 scopus 로고    scopus 로고
    • Active inference and agency: Optimal control without cost functions
    • Friston K, Samothrakis S, Montague R. 2012 Active inference and agency: optimal control without cost functions. Biol. Cybern. 106, 523-541. (doi:10. 1007/s00422-012-0512-8)
    • (2012) Biol. Cybern. , vol.106 , pp. 523-541
    • Friston, K.1    Samothrakis, S.2    Montague, R.3
  • 42
    • 34547699502 scopus 로고    scopus 로고
    • Functional specialization of the primate frontal cortex during decision making
    • Lee D, Rushworth MFS, Walton ME, Watanabe M, Sakagami M. 2007 Functional specialization of the primate frontal cortex during decision making. J. Neurosci. 27, 8170-8173. (doi:10. 1523/ JNEUROSCI. 1561-07. 2007)
    • (2007) J. Neurosci. , vol.27 , pp. 8170-8173
    • Lee, D.1    Rushworth, M.F.S.2    Walton, M.E.3    Watanabe, M.4    Sakagami, M.5
  • 43
    • 84877280273 scopus 로고    scopus 로고
    • Thermodynamics as a theory of decision-making with informationprocessing costs
    • Ortega PA, Braun DA. 2013 Thermodynamics as a theory of decision-making with informationprocessing costs. Proc. R. Soc. A 469, 20120683. (doi:10. 1098/rspa. 2012. 0683)
    • (2013) Proc. R. Soc. A , vol.469 , pp. 20120683
    • Ortega, P.A.1    Braun, D.A.2
  • 44
    • 0036278801 scopus 로고    scopus 로고
    • Task partitioning in insect societies: Bucket brigades
    • Anderson C, Boomsma JJ, Bartholdi JJ. 2002 Task partitioning in insect societies: bucket brigades. Insectes Soc. 49, 171-180. (doi:10. 1007/s00040-002-8298-7)
    • (2002) Insectes Soc. , vol.49 , pp. 171-180
    • Anderson, C.1    Boomsma, J.J.2    Bartholdi, J.J.3
  • 45
    • 84879719876 scopus 로고    scopus 로고
    • Time delay implies cost on task switching: A model to investigate the efficiency of task partitioning
    • Hamann H, Karsai I, Schmickl T. 2013 Time delay implies cost on task switching: a model to investigate the efficiency of task partitioning. Bull. Math. Biol. 75, 1181-1206. (doi:10. 1007/s11538-013-9851-4)
    • (2013) Bull. Math. Biol. , vol.75 , pp. 1181-1206
    • Hamann, H.1    Karsai, I.2    Schmickl, T.3
  • 46
    • 79959426387 scopus 로고    scopus 로고
    • Regulation of task partitioning by a common stomach: A model of nest construction in social wasps
    • Karsai I, Schmickl T. 2011 Regulation of task partitioning by a common stomach: a model of nest construction in social wasps. Behav. Ecol. 22, 819-830. (doi:10. 1093/beheco/arr060)
    • (2011) Behav. Ecol. , vol.22 , pp. 819-830
    • Karsai, I.1    Schmickl, T.2
  • 47
    • 84855505401 scopus 로고    scopus 로고
    • Stop signals provide cross inhibition in collective decision-making by honeybee swarms
    • Seeley TD, Visscher PK, Schlegel T, Hogan PM, Franks NR, Marshall JAR. 2012 Stop signals provide cross inhibition in collective decision-making by honeybee swarms. Science 335, 108-111. (doi:10. 1126/science. 1210361)
    • (2012) Science , vol.335 , pp. 108-111
    • Seeley, T.D.1    Visscher, P.K.2    Schlegel, T.3    Hogan, P.M.4    Franks, N.R.5    Marshall, J.A.R.6
  • 48
    • 78751572785 scopus 로고    scopus 로고
    • Swarm cognition: An interdisciplinary approach to the study of self-organising biological collectives
    • Trianni V, Tuci E, Passino KM, Marshall JAR. 2011 Swarm cognition: an interdisciplinary approach to the study of self-organising biological collectives. Swarm Intell. 5, 3-18. (doi:10. 1007/s11721-010-0050-8)
    • (2011) Swarm Intell. , vol.5 , pp. 3-18
    • Trianni, V.1    Tuci, E.2    Passino, K.M.3    Marshall, J.A.R.4
  • 49
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw ND, Niv Y, Dayan P. 2005 Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 8, 1704-1711. (doi:10. 1038/nn1560)
    • (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 50
    • 84857290855 scopus 로고    scopus 로고
    • The value of foresight: How prospection affects decision-making
    • Pezzulo G, Rigoli F. 2011 The value of foresight: how prospection affects decision-making. Front. Neurosci. 5, 79. (doi:10. 3389/fnins. 2011. 00079)
    • (2011) Front. Neurosci. , vol.5 , pp. 79
    • Pezzulo, G.1    Rigoli, F.2
  • 51
    • 84907494705 scopus 로고    scopus 로고
    • The principles of goaldirected decision-making: From neural mechanisms to computation and robotics
    • Pezzulo G, Verschure P, Balkenius C, Pennartz C. 2014 The principles of goaldirected decision-making: from neural mechanisms to computation and robotics. Phil. Trans. R. Soc. B 369, 20130470. (doi:10. 1098/ rstb. 2013. 0470)
    • (2014) Phil. Trans. R. Soc. B , vol.369 , pp. 20130470
    • Pezzulo, G.1    Verschure, P.2    Balkenius, C.3    Pennartz, C.4
  • 52
    • 84927125372 scopus 로고    scopus 로고
    • Internally generated sequences in learning and executing goal-directed behavior
    • Pezzulo G, van der Meer MA, Lansink CS, Pennartz CMA. 2014 Internally generated sequences in learning and executing goal-directed behavior. Trends Cogn. Sci. 18, 647-657. (doi:10. 1016/j. tics. 2014. 06. 011)
    • (2014) Trends Cogn. Sci. , vol.18 , pp. 647-657
    • Pezzulo, G.1    Van Der Meer, M.A.2    Lansink, C.S.3    Pennartz, C.M.A.4
  • 53
    • 84907494746 scopus 로고    scopus 로고
    • The why, what, where, when and how of goal directed choice: Neuronal and computational principles
    • Verschure P, Pennartz C, Pezzulo G. 2014 The why, what, where, when and how of goal directed choice: neuronal and computational principles. Phil. Trans. R. Soc. B 369, 20130483. (doi:10. 1098/rstb. 2013. 0483)
    • (2014) Phil. Trans. R. Soc. B , vol.369 , pp. 20130483
    • Verschure, P.1    Pennartz, C.2    Pezzulo, G.3
  • 54
    • 0002014402 scopus 로고
    • Possible principles underlying the transformation of sensory messages
    • (ed. WA Rosenblith). Cambridge, MA: MIT Press
    • Barlow HB. 1961 Possible principles underlying the transformation of sensory messages. In Sensory communication (ed. WA Rosenblith). Cambridge, MA: MIT Press.
    • (1961) Sensory Communication
    • Barlow, H.B.1
  • 55
    • 57149100752 scopus 로고    scopus 로고
    • A hierarchy of time-scales and the brain
    • Kiebel SJ, Daunizeau J, Friston KJ. 2008 A hierarchy of time-scales and the brain. PLoS Comput. Biol. 4, e1000209. (doi:10. 1371/journal. pcbi. 1000209)
    • (2008) PLoS Comput. Biol. , vol.4 , pp. e1000209
    • Kiebel, S.J.1    Daunizeau, J.2    Friston, K.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.