메뉴 건너뛰기




Volumn 106, Issue 8-9, 2012, Pages 523-541

Active inference and agency: Optimal control without cost functions

Author keywords

Action; Agency; Bayesian; Free energy; Inference; Optimal control; Partially observable Markov decision processes

Indexed keywords

ACTION; AGENCY; BAYESIAN; INFERENCE; OPTIMAL CONTROLS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS;

EID: 84867578045     PISSN: 03401200     EISSN: 14320770     Source Type: Journal    
DOI: 10.1007/s00422-012-0512-8     Document Type: Article
Times cited : (195)

References (80)
  • 1
    • 33845251896 scopus 로고
    • Principles of the self-organizing dynamic system
    • Ashby WR (1947) Principles of the self-organizing dynamic system. J Gen Psychol 37:125-128
    • (1947) J Gen Psychol , vol.37 , pp. 125-128
    • Ashby, W.R.1
  • 3
    • 0013495368 scopus 로고    scopus 로고
    • Experiments with infinite- horizon, policy-gradient estimation
    • Baxter J, Bartlett PL, Weaver L (2001) Experiments with Infinite- Horizon, Policy-Gradient Estimation. J Artif Intell Res 15:351-381
    • (2001) J Artif Intell Res , vol.15 , pp. 351-381
    • Baxter, J.1    Bartlett, P.L.2    Weaver, L.3
  • 5
    • 0008556523 scopus 로고
    • On the theory of dynamic programming
    • Bellman R (1952) On the theory of dynamic programming. Proc Natl Acad Sci USA 38:716-719
    • (1952) Proc Natl Acad Sci USA , vol.38 , pp. 716-719
    • Bellman, R.1
  • 6
    • 2442701355 scopus 로고    scopus 로고
    • Motivation concepts in behavioral neuroscience
    • Berridge KC (2004) Motivation concepts in behavioral neuroscience. Physiol Behav 81(2):179-209
    • (2004) Physiol Behav , vol.81 , Issue.2 , pp. 179-209
    • Berridge, K.C.1
  • 7
    • 0001650497 scopus 로고
    • Proof of the ergodic theorem
    • Birkhoff GD (1931) Proof of the ergodic theorem. Proc Natl Acad Sci USA 17:656-660
    • (1931) Proc Natl Acad Sci USA , vol.17 , pp. 656-660
    • Birkhoff, G.D.1
  • 8
    • 70049104354 scopus 로고    scopus 로고
    • Goal-directed decision making in prefrontal cortex: A computational framework
    • BotvinickMM,AnJ (2008) Goal-directed decision making in prefrontal cortex: a computational framework. Adv Neural Inf Process Syst (NIPS) 21
    • (2008) Adv Neural Inf Process Syst (NIPS) , vol.21
    • Botvinick, M.M.1    An, J.2
  • 10
    • 0001214234 scopus 로고
    • A complete class theorem for statistical problems with finite sample spaces
    • Brown LD (1981) A complete class theorem for statistical problems with finite sample spaces. Ann Stat 9(6):1289-1300
    • (1981) Ann Stat , vol.9 , Issue.6 , pp. 1289-1300
    • Brown, L.D.1
  • 11
    • 0038060580 scopus 로고    scopus 로고
    • Behavioural studies of strategic thinking in games
    • Camerer CF (2003) Behavioural studies of strategic thinking in games. Trends Cogn Sci 7(5):225-231
    • (2003) Trends Cogn Sci , vol.7 , Issue.5 , pp. 225-231
    • Camerer, C.F.1
  • 14
    • 33646492363 scopus 로고    scopus 로고
    • The computational neurobiology of learning and reward
    • Daw ND, Doya K (2006) The computational neurobiology of learning and reward. Curr Opin Neurobiol 16(2):199-204
    • (2006) Curr Opin Neurobiol , vol.16 , Issue.2 , pp. 199-204
    • Daw, N.D.1    Doya, K.2
  • 15
    • 60749114870 scopus 로고    scopus 로고
    • Decision theory, reinforcement learning, and the brain
    • Dayan P, Daw ND (2008) Decision theory, reinforcement learning, and the brain. Cogn Affect Behav Neurosci 8(4):429-453
    • (2008) Cogn Affect Behav Neurosci , vol.8 , Issue.4 , pp. 429-453
    • Dayan, P.1    Daw, N.D.2
  • 16
    • 0346982426 scopus 로고    scopus 로고
    • Using expectation maximization for reinforcement learning
    • Dayan P, Hinton GE (1997) Using expectation maximization for reinforcement learning. Neural Comput 9:271-278
    • (1997) Neural Comput , vol.9 , pp. 271-278
    • Dayan, P.1    Hinton, G.E.2
  • 19
    • 1542347541 scopus 로고    scopus 로고
    • A non-equilibrium free energy theorem for deterministic systems
    • Evans DJ (2003) A non-equilibrium free energy theorem for deterministic systems. Mol Phys 101:15551-15554
    • (2003) Mol Phys , vol.101 , pp. 15551-15554
    • Evans, D.J.1
  • 20
    • 0011716051 scopus 로고
    • Dual control theory, Part i
    • Feldbaum AA (1961) Dual control theory, Part I. Autom Remote Control 21(9):874-880
    • (1961) Autom Remote Control , vol.21 , Issue.9 , pp. 874-880
    • Feldbaum, A.A.1
  • 21
    • 79952575205 scopus 로고    scopus 로고
    • Attention, uncertainty, and free-energy
    • Feldman H, Friston KJ (2010) Attention, uncertainty, and free-energy. Front Hum Neurosci 4:215
    • (2010) Front Hum Neurosci , vol.4 , pp. 215
    • Feldman, H.1    Friston, K.J.2
  • 25
    • 57149113922 scopus 로고    scopus 로고
    • Hierarchical models in the brain
    • Friston K (2008) Hierarchical models in the brain. PLoS Comput Biol 4(11):e1000211
    • (2008) PLoS Comput Biol , vol.4 , Issue.11
    • Friston, K.1
  • 26
    • 75549090229 scopus 로고    scopus 로고
    • The free-energy principle: A unified brain theory?
    • FristonK (2010) The free-energy principle: a unified brain theory?. Nat Rev Neurosci 11(2):127-138
    • (2010) Nat Rev Neurosci , vol.11 , Issue.2 , pp. 127-138
    • Friston, K.1
  • 27
    • 83455164974 scopus 로고    scopus 로고
    • What is optimal about motor control?
    • Friston K (2011) What is optimal about motor control?. Neuron 72(3):488-498
    • (2011) Neuron , vol.72 , Issue.3 , pp. 488-498
    • Friston, K.1
  • 29
    • 70349387424 scopus 로고    scopus 로고
    • Cortical circuits for perceptual inference
    • Friston K, Kiebel S (2009) Cortical circuits for perceptual inference. Neural Netw 22(8):1093-1104
    • (2009) Neural Netw , vol.22 , Issue.8 , pp. 1093-1104
    • Friston, K.1    Kiebel, S.2
  • 30
    • 66149122170 scopus 로고    scopus 로고
    • Predictive coding under the free-energy principle
    • Friston K, Kiebel S (2009) Predictive coding under the free-energy principle. Philos Trans R Soc Lond B Biol Sci 364(1521):1211-1221
    • (2009) Philos Trans R Soc Lond B Biol Sci , vol.364 , Issue.1521 , pp. 1211-1221
    • Friston, K.1    Kiebel, S.2
  • 31
    • 68149131857 scopus 로고    scopus 로고
    • Active inference or reinforcement learning?
    • Friston KJ, Daunizeau J, Kiebel SJ (2009) Active inference or reinforcement learning?. PLoS One 4(7):e6421
    • (2009) PLoS One , vol.4 , Issue.7
    • Friston, K.J.1    Daunizeau, J.2    Kiebel, S.J.3
  • 34
    • 37849187806 scopus 로고    scopus 로고
    • A free energy principle for the brain
    • Friston K, Kilner J, Harrison L (2006) A free energy principle for the brain. J Physiol Paris 100(1-3):70-87
    • (2006) J Physiol Paris , vol.100 , Issue.1-3 , pp. 70-87
    • Friston, K.1    Kilner, J.2    Harrison, L.3
  • 35
    • 79952575636 scopus 로고    scopus 로고
    • Action understanding and active inference
    • Friston K, Mattout J, Kilner J (2011) Action understanding and active inference. Biol Cybern 104:137-160
    • (2011) Biol Cybern , vol.104 , pp. 137-160
    • Friston, K.1    Mattout, J.2    Kilner, J.3
  • 36
    • 0028351678 scopus 로고
    • Value-dependent selection in the brain: Simulation in a synthetic neural model
    • Friston KJ, Tononi G, Reeke GNJ, Sporns O, Edelman GM (1994) Value-dependent selection in the brain: simulation in a synthetic neural model. Neuroscience 59(2):229-243
    • (1994) Neuroscience , vol.59 , Issue.2 , pp. 229-243
    • Friston, K.J.1    Tononi, G.2    Reeke, G.N.J.3    Sporns, O.4    Edelman, G.M.5
  • 38
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
    • Gläscher J, Daw N, Dayan P, O'Doherty JP (2010) States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 66(4):585-595
    • (2010) Neuron , vol.66 , Issue.4 , pp. 585-595
    • Gläscher, J.1    Daw, N.2    Dayan, P.3    O'Doherty, J.P.4
  • 39
    • 1142294564 scopus 로고    scopus 로고
    • Learning robust nonlinear control with neuroevolution
    • Department of Computer Sciences, The University of Texas at Austin
    • Gomez F, Miikkulainen R (2001) Learning robust nonlinear control with neuroevolution. Technical Report AI01-292, Department of Computer Sciences, The University of Texas at Austin
    • (2001) Technical Report AI01-292
    • Gomez, F.1    Miikkulainen, R.2
  • 40
    • 44649193889 scopus 로고    scopus 로고
    • Accelerated neural evolution through cooperatively coevolved synapses
    • Gomez F, Schmidhuber J, Miikkulainen R (2009) Accelerated neural evolution through cooperatively coevolved synapses. J Mach Learn Res 9:937-965
    • (2009) J Mach Learn Res , vol.9 , pp. 937-965
    • Gomez, F.1    Schmidhuber, J.2    Miikkulainen, R.3
  • 41
    • 0013348652 scopus 로고
    • Concerning the perceptions in general
    • 3rd edn. Dover, New York
    • Helmholtz H (1866/1962), Concerning the perceptions in general. In: Treatise on physiological optics, 3rd edn. Dover, New York
    • (1866) Treatise on Physiological Optics
    • Helmholtz, H.1
  • 42
    • 0027803368 scopus 로고
    • Keeping neural networks simple by minimizing the description length of weights
    • Hinton GE, van Camp D (1993) Keeping neural networks simple by minimizing the description length of weights. In: Proceedings of COLT-93,pp 5-13
    • (1993) Proceedings of COLT-93 , pp. 5-13
    • Hinton, G.E.1    Van Camp, D.2
  • 45
    • 0034198996 scopus 로고    scopus 로고
    • Observable operator models for discrete stochastic time series
    • Jaeger H (2000) Observable operator models for discrete stochastic time series. Neural Comput 12:1371-1398
    • (2000) Neural Comput , vol.12 , pp. 1371-1398
    • Jaeger, H.1
  • 47
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling LP, Littman ML, Cassandra AR (1998) Planning and acting in partially observable stochastic domains. Artif Intell 101 (1-2):99-134
    • (1998) Artif Intell , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 48
    • 28844435646 scopus 로고    scopus 로고
    • Linear theory for control of nonlinear stochastic systems
    • KappenHJ (2005) Linear theory for control of nonlinear stochastic systems. Phys Rev Lett 95(20):200201
    • (2005) Phys Rev Lett , vol.95 , Issue.20 , pp. 200201
    • Kappen, H.J.1
  • 49
    • 29044440299 scopus 로고    scopus 로고
    • Path integrals and symmetry breaking for optimal control theory
    • Kappen HJ (2005) Path integrals and symmetry breaking for optimal control theory. J Stat Mech: Theory Exp 11:P11011
    • (2005) J Stat Mech: Theory Exp , vol.11
    • Kappen, H.J.1
  • 53
    • 77955962977 scopus 로고    scopus 로고
    • Neuroeconomic approaches to mental disorders
    • Kishida KT, King-Casas B, Montague PR (2010) Neuroeconomic approaches to mental disorders. Neuron 67(4):543-554
    • (2010) Neuron , vol.67 , Issue.4 , pp. 543-554
    • Kishida, K.T.1    King-Casas, B.2    Montague, P.R.3
  • 56
    • 0029272806 scopus 로고
    • Free-energy minimisation algorithm for decoding and cryptoanalysis
    • MacKay DJ (1995) Free-energy minimisation algorithm for decoding and cryptoanalysis. Electron Lett 31:445-447
    • (1995) Electron Lett , vol.31 , pp. 445-447
    • MacKay, D.J.1
  • 57
    • 0028972278 scopus 로고
    • Bee foraging in uncertain environments using predictive Hebbian learning
    • Montague PR,Dayan P, Person C, Sejnowski TJ (1995) Bee foraging in uncertain environments using predictive Hebbian learning. Nature 377(6551):725-728
    • (1995) Nature , vol.377 , Issue.6551 , pp. 725-728
    • Montague, P.R.1    Dayan, P.2    Person, C.3    Sejnowski, T.J.4
  • 59
    • 80055087257 scopus 로고    scopus 로고
    • A neurodynamic account of spontaneous behaviour
    • Namikawa J, Nishimoto R, Tani J (2011) A neurodynamic account of spontaneous behaviour. PLoS Comput Biol. 7(10):e1002221
    • (2011) PLoS Comput Biol. , vol.7 , Issue.10
    • Namikawa, J.1    Nishimoto, R.2    Tani, J.3
  • 60
    • 0002788893 scopus 로고    scopus 로고
    • A view of the em algorithm that justifies incremental sparse and other variants
    • JordanM(ed.) Kluwer Academic, Dordrecht
    • Neal RM, Hinton GE (1998) A view of the EM algorithm that justifies incremental sparse and other variants. In: JordanM(ed.) Learning in graphical models. Kluwer Academic, Dordrecht
    • (1998) Learning in Graphical Models
    • Neal, R.M.1    Hinton, G.E.2
  • 63
    • 79960241771 scopus 로고    scopus 로고
    • Decision making under uncertainty: A neural model based on partially observable markov decision processes
    • Rao RP (2010) Decision making under uncertainty: a neural model based on partially observable markov decision processes. Front Comput Neurosci 4:146
    • (2010) Front Comput Neurosci , vol.4 , pp. 146
    • Rao, R.P.1
  • 64
    • 0033360288 scopus 로고    scopus 로고
    • Predictive coding in the visual cortex: A functional interpretation of some extra-classical receptive-field effects
    • Rao RP, Ballard DH (1999) Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci 2(1):79-87
    • (1999) Nat Neurosci , vol.2 , Issue.1 , pp. 79-87
    • Rao, R.P.1    Ballard, D.H.2
  • 66
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • A Black, W Prokasy (eds.). Appleton Century Crofts, New York
    • Rescorla RA, Wagner AR (1972) A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In: A Black, W Prokasy (eds.) Classical conditioning II: current research and theory. Appleton Century Crofts, New York
    • (1972) Classical Conditioning II: Current Research and Theory
    • Rescorla, R.A.1    Wagner, A.R.2
  • 67
    • 0009680239 scopus 로고
    • L'analyse statistique Bayesienne
    • Paris, France
    • RobertC(1992)L'analyse statistique Bayesienne. In: Economica. Paris, France
    • (1992) Economica
    • Robert, C.1
  • 68
    • 0024038570 scopus 로고
    • Probabilistic inference and influence diagrams
    • Shachter RD (1988) Probabilistic inference and influence diagrams. Operat Res 36:589-605
    • (1988) Operat Res , vol.36 , pp. 589-605
    • Shachter, R.D.1
  • 70
    • 0019537951 scopus 로고
    • Toward a modern theory of adaptive networks: Expectation and prediction
    • Sutton RS, Barto AG (1981) Toward a modern theory of adaptive networks: expectation and prediction. Psychol Rev 88(2):135-170
    • (1981) Psychol Rev , vol.88 , Issue.2 , pp. 135-170
    • Sutton, R.S.1    Barto, A.G.2
  • 71
    • 0037258296 scopus 로고    scopus 로고
    • Learning to generate articulated behavior through the bottom-up and the top-down interaction processes
    • Tani J (2003) Learning to generate articulated behavior through the bottom-up and the top-down interaction processes. Neural Netw 16(1):11-23
    • (2003) Neural Netw , vol.16 , Issue.1 , pp. 11-23
    • Tani, J.1
  • 72
    • 79551503171 scopus 로고    scopus 로고
    • A generalized path integral control approach to reinforcement learning
    • Theodorou E, Buchli J, Schaal S (2010) A generalized path integral control approach to reinforcement learning. J Mach Learn Res 11:3137-3181
    • (2010) J Mach Learn Res , vol.11 , pp. 3137-3181
    • Theodorou, E.1    Buchli, J.2    Schaal, S.3
  • 77
    • 52249107868 scopus 로고    scopus 로고
    • Graphical model inference in optimal control of stochastic multi-agent systems
    • Van den Broek B, Wiegerinck W, Kappen B (2008) Graphical model inference in optimal control of stochastic multi-agent systems. J Artif Int Res 32(1):95-122
    • (2008) J Artif Int Res , vol.32 , Issue.1 , pp. 95-122
    • Van Den Broek, B.1    Wiegerinck, W.2    Kappen, B.3
  • 79
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • WilliamsRJ (1992) Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach Learn 8:229-256
    • (1992) Mach Learn , vol.8 , pp. 229-256
    • Williams, R.J.1
  • 80
    • 0032210433 scopus 로고    scopus 로고
    • Probabilistic inference in influence diagrams
    • Zhang NL (1998) Probabilistic inference in influence diagrams. Comput Intell 14(4):475-497
    • (1998) Comput Intell , vol.14 , Issue.4 , pp. 475-497
    • Zhang, N.L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.