SCOPUS 정보 검색 플랫폼

Biological Cybernetics

Volumn 106, Issue 8-9, 2012, Pages 523-541

Active inference and agency: Optimal control without cost functions

(3) Friston, Karl a Samothrakis, Spyridon b Montague, Read c

a UNIVERSITY COLLEGE LONDON (United Kingdom)

b UNIVERSITY OF ESSEX (United Kingdom)

c VIRGINIA TECH CARILION SCHOOL OF MEDICINE (United States)

Author keywords

Action; Agency; Bayesian; Free energy; Inference; Optimal control; Partially observable Markov decision processes

Indexed keywords

ACTION; AGENCY; BAYESIAN; INFERENCE; OPTIMAL CONTROLS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS;

COST FUNCTIONS; FREE ENERGY;

CONTROL;

ALGORITHM; ARTICLE; BAYES THEOREM; BIOLOGICAL MODEL; DECISION MAKING; PHYSIOLOGY; PROBABILITY;

ALGORITHMS; BAYES THEOREM; DECISION MAKING; MARKOV CHAINS; MODELS, NEUROLOGICAL;

EID: 84867578045 PISSN: 03401200 EISSN: 14320770 Source Type: Journal
DOI: 10.1007/s00422-012-0512-8 Document Type: Article

Times cited : (195)

References (80)

1
- 33845251896
- Principles of the self-organizing dynamic system
- Ashby WR (1947) Principles of the self-organizing dynamic system. J Gen Psychol 37:125-128
- (1947) J Gen Psychol , vol.37 , pp. 125-128
- Ashby, W.R.¹

2
- 77649259491
- Cross-frequency coupling supports multi-itemworking memory in the human hippocampus
- Axmacher N, Henseler MM, Jensen O, Weinreich I, Elger CE, Fell J (2010) Cross-frequency coupling supports multi-itemworking memory in the human hippocampus. Proc Natl Acad Sci 107(7):3228-3233
- (2010) Proc Natl Acad Sci , vol.107 , Issue.7 , pp. 3228-3233
- Axmacher, N.¹ Henseler, M.M.² Jensen, O.³ Weinreich, I.⁴ Elger, C.E.⁵ Fell, J.⁶

3
- 0013495368
- Experiments with infinite- horizon, policy-gradient estimation
- Baxter J, Bartlett PL, Weaver L (2001) Experiments with Infinite- Horizon, Policy-Gradient Estimation. J Artif Intell Res 15:351-381
- (2001) J Artif Intell Res , vol.15 , pp. 351-381
- Baxter, J.¹ Bartlett, P.L.² Weaver, L.³

4
- 3543081155
- PhD. Thesis, University College London, London
- Beal MJ (2003) Variational algorithms for approximate bayesian inference'. PhD. Thesis, University College London, London
- (2003) Variational Algorithms for Approximate Bayesian Inference
- Beal, M.J.¹

5
- 0008556523
- On the theory of dynamic programming
- Bellman R (1952) On the theory of dynamic programming. Proc Natl Acad Sci USA 38:716-719
- (1952) Proc Natl Acad Sci USA , vol.38 , pp. 716-719
- Bellman, R.¹

6
- 2442701355
- Motivation concepts in behavioral neuroscience
- Berridge KC (2004) Motivation concepts in behavioral neuroscience. Physiol Behav 81(2):179-209
- (2004) Physiol Behav , vol.81 , Issue.2 , pp. 179-209
- Berridge, K.C.¹

7
- 0001650497
- Proof of the ergodic theorem
- Birkhoff GD (1931) Proof of the ergodic theorem. Proc Natl Acad Sci USA 17:656-660
- (1931) Proc Natl Acad Sci USA , vol.17 , pp. 656-660
- Birkhoff, G.D.¹

8
- 70049104354
- Goal-directed decision making in prefrontal cortex: A computational framework
- BotvinickMM,AnJ (2008) Goal-directed decision making in prefrontal cortex: a computational framework. Adv Neural Inf Process Syst (NIPS) 21
- (2008) Adv Neural Inf Process Syst (NIPS) , vol.21
- Botvinick, M.M.¹ An, J.²

9
- 80052231358
- Path integral control and bounded rationality
- Paris
- BraunDA, Ortega P, Theodorou E, Schaal S (2011) Path integral control and bounded rationality. In: ADPRL 2011, Paris
- (2011) ADPRL 2011
- Braunda Ortega, P.¹ Theodorou, E.² Schaal, S.³

10
- 0001214234
- A complete class theorem for statistical problems with finite sample spaces
- Brown LD (1981) A complete class theorem for statistical problems with finite sample spaces. Ann Stat 9(6):1289-1300
- (1981) Ann Stat , vol.9 , Issue.6 , pp. 1289-1300
- Brown, L.D.¹

11
- 0038060580
- Behavioural studies of strategic thinking in games
- Camerer CF (2003) Behavioural studies of strategic thinking in games. Trends Cogn Sci 7(5):225-231
- (2003) Trends Cogn Sci , vol.7 , Issue.5 , pp. 225-231
- Camerer, C.F.¹

12
- 33748808979
- High gamma power is phase-locked to theta oscillations in human neocortex
- Canolty RT, Edwards E, Dalal SS, SoltaniM,Nagarajan SS,KirschHE, Berger MS, Barbaro NM, Knight R (2006) High gamma power is phase-locked to theta oscillations in human neocortex. Science 313(5793):1626-1628
- (2006) Science , vol.313 , Issue.5793 , pp. 1626-1628
- Canolty, R.T.¹ Edwards, E.² Dalal, S.S.³ Soltani, M.⁴ Nagarajan, S.S.⁵ Kirsch, H.E.⁶ Berger, M.S.⁷ Barbaro, N.M.⁸ Knight, R.⁹

13
- 0008586604
- A method for using belief networks as influence diagrams
- Cooper G (1988) A method for using belief networks as influence diagrams. In: Proceedings of the Conference on uncertainty in artificial intelligence
- (1988) Proceedings of the Conference on Uncertainty in Artificial Intelligence
- Cooper, G.¹

14
- 33646492363
- The computational neurobiology of learning and reward
- Daw ND, Doya K (2006) The computational neurobiology of learning and reward. Curr Opin Neurobiol 16(2):199-204
- (2006) Curr Opin Neurobiol , vol.16 , Issue.2 , pp. 199-204
- Daw, N.D.¹ Doya, K.²

15
- 60749114870
- Decision theory, reinforcement learning, and the brain
- Dayan P, Daw ND (2008) Decision theory, reinforcement learning, and the brain. Cogn Affect Behav Neurosci 8(4):429-453
- (2008) Cogn Affect Behav Neurosci , vol.8 , Issue.4 , pp. 429-453
- Dayan, P.¹ Daw, N.D.²

16
- 0346982426
- Using expectation maximization for reinforcement learning
- Dayan P, Hinton GE (1997) Using expectation maximization for reinforcement learning. Neural Comput 9:271-278
- (1997) Neural Comput , vol.9 , pp. 271-278
- Dayan, P.¹ Hinton, G.E.²

17
- 0029372831
- The Helmholtz machine
- Dayan P, Hinton GE, Neal R (1995) The Helmholtz machine. Neural Comput 7:889-904
- (1995) Neural Comput , vol.7 , pp. 889-904
- Dayan, P.¹ Hinton, G.E.² Neal, R.³

18
- 1942450858
- PhD thesis. University of Massachusetts, Amherst
- Duff M, (2002) Optimal learning: computational procedure for bayes- adaptive markov decision processes. PhD thesis. University of Massachusetts, Amherst
- (2002) Optimal Learning: Computational Procedure for Bayes- Adaptive Markov Decision Processes
- Duff, M.¹

19
- 1542347541
- A non-equilibrium free energy theorem for deterministic systems
- Evans DJ (2003) A non-equilibrium free energy theorem for deterministic systems. Mol Phys 101:15551-15554
- (2003) Mol Phys , vol.101 , pp. 15551-15554
- Evans, D.J.¹

20
- 0011716051
- Dual control theory, Part i
- Feldbaum AA (1961) Dual control theory, Part I. Autom Remote Control 21(9):874-880
- (1961) Autom Remote Control , vol.21 , Issue.9 , pp. 874-880
- Feldbaum, A.A.¹

21
- 79952575205
- Attention, uncertainty, and free-energy
- Feldman H, Friston KJ (2010) Attention, uncertainty, and free-energy. Front Hum Neurosci 4:215
- (2010) Front Hum Neurosci , vol.4 , pp. 215
- Feldman, H.¹ Friston, K.J.²

22
- 0003575440
- Benjamin, Reading M.A
- Feynman RP (1972) Statistical mechanics. Benjamin, Reading M.A.
- (1972) Statistical Mechanics
- Feynman, R.P.¹

23
- 33644956365
- Springer, Berlin
- Filatov N, Unbehauen H (2004) Adaptive dual control: theory and applications (lecture notes in control and information sciences. Springer, Berlin
- (2004) Adaptive Dual Control: Theory and Applications (Lecture Notes in Control and Information Sciences
- Filatov, N.¹ Unbehauen, H.²

24
- 84867582211
- A tutorial on variational Bayes
- Spinger, Berlin
- Fox C, Roberts S (2011) A tutorial on variational Bayes. In: Artificial intelligence review. Spinger, Berlin
- (2011) Artificial Intelligence Review
- Fox, C.¹ Roberts, S.²

25
- 57149113922
- Hierarchical models in the brain
- Friston K (2008) Hierarchical models in the brain. PLoS Comput Biol 4(11):e1000211
- (2008) PLoS Comput Biol , vol.4 , Issue.11
- Friston, K.¹

26
- 75549090229
- The free-energy principle: A unified brain theory?
- FristonK (2010) The free-energy principle: a unified brain theory?. Nat Rev Neurosci 11(2):127-138
- (2010) Nat Rev Neurosci , vol.11 , Issue.2 , pp. 127-138
- Friston, K.¹

27
- 83455164974
- What is optimal about motor control?
- Friston K (2011) What is optimal about motor control?. Neuron 72(3):488-498
- (2011) Neuron , vol.72 , Issue.3 , pp. 488-498
- Friston, K.¹

28
- 84855582937
- Free-2012energy, value and attractors
- Friston K, Ao P (2012) Free-energy, value and attractors. In: Computational and mathematical methods in medicine, vol 2012
- (2012) Computational and Mathematical Methods in Medicine , vol.2012
- Friston, K.¹ Ao, P.²

29
- 70349387424
- Cortical circuits for perceptual inference
- Friston K, Kiebel S (2009) Cortical circuits for perceptual inference. Neural Netw 22(8):1093-1104
- (2009) Neural Netw , vol.22 , Issue.8 , pp. 1093-1104
- Friston, K.¹ Kiebel, S.²

30
- 66149122170
- Predictive coding under the free-energy principle
- Friston K, Kiebel S (2009) Predictive coding under the free-energy principle. Philos Trans R Soc Lond B Biol Sci 364(1521):1211-1221
- (2009) Philos Trans R Soc Lond B Biol Sci , vol.364 , Issue.1521 , pp. 1211-1221
- Friston, K.¹ Kiebel, S.²

31
- 68149131857
- Active inference or reinforcement learning?
- Friston KJ, Daunizeau J, Kiebel SJ (2009) Active inference or reinforcement learning?. PLoS One 4(7):e6421
- (2009) PLoS One , vol.4 , Issue.7
- Friston, K.J.¹ Daunizeau, J.² Kiebel, S.J.³

32
- 77952091673
- Action and behavior: A free-energy formulation
- Friston KJ, Daunizeau J, Kilner J, Kiebel SJ (2010) Action and behavior: a free-energy formulation. Biol Cybern 102(3): 227-260
- (2010) Biol Cybern , vol.102 , Issue.3 , pp. 227-260
- Friston, K.J.¹ Daunizeau, J.² Kilner, J.³ Kiebel, S.J.⁴

33
- 84857460946
- Dopamine, affordance and active inference
- Friston KJST, Fitzgerald T, Galea JM, Adams R, Brown H, Dolan RJ, Moran R, Stephan KE, Bestmann S (2012) Dopamine, affordance and active inference. PLoS Comput Biol 8(1):e1002327
- (2012) PLoS Comput Biol , vol.8 , Issue.1
- Friston, K.J.S.T.¹ Fitzgerald, T.² Galea, J.M.³ Adams, R.⁴ Brown, H.⁵ Dolan, R.J.⁶ Moran, R.⁷ Stephan, K.E.⁸ Bestmann, S.⁹

34
- 37849187806
- A free energy principle for the brain
- Friston K, Kilner J, Harrison L (2006) A free energy principle for the brain. J Physiol Paris 100(1-3):70-87
- (2006) J Physiol Paris , vol.100 , Issue.1-3 , pp. 70-87
- Friston, K.¹ Kilner, J.² Harrison, L.³

35
- 79952575636
- Action understanding and active inference
- Friston K, Mattout J, Kilner J (2011) Action understanding and active inference. Biol Cybern 104:137-160
- (2011) Biol Cybern , vol.104 , pp. 137-160
- Friston, K.¹ Mattout, J.² Kilner, J.³

36
- 0028351678
- Value-dependent selection in the brain: Simulation in a synthetic neural model
- Friston KJ, Tononi G, Reeke GNJ, Sporns O, Edelman GM (1994) Value-dependent selection in the brain: simulation in a synthetic neural model. Neuroscience 59(2):229-243
- (1994) Neuroscience , vol.59 , Issue.2 , pp. 229-243
- Friston, K.J.¹ Tononi, G.² Reeke, G.N.J.³ Sporns, O.⁴ Edelman, G.M.⁵

37
- 78650969684
- Heuristic decision making
- Gigerenzer G, Gaissmaier W (2011) Heuristic decision making. Annu Rev Psychol 62:451-482
- (2011) Annu Rev Psychol , vol.62 , pp. 451-482
- Gigerenzer, G.¹ Gaissmaier, W.²

38
- 77953260848
- States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
- Gläscher J, Daw N, Dayan P, O'Doherty JP (2010) States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 66(4):585-595
- (2010) Neuron , vol.66 , Issue.4 , pp. 585-595
- Gläscher, J.¹ Daw, N.² Dayan, P.³ O'Doherty, J.P.⁴

39
- 1142294564
- Learning robust nonlinear control with neuroevolution
- Department of Computer Sciences, The University of Texas at Austin
- Gomez F, Miikkulainen R (2001) Learning robust nonlinear control with neuroevolution. Technical Report AI01-292, Department of Computer Sciences, The University of Texas at Austin
- (2001) Technical Report AI01-292
- Gomez, F.¹ Miikkulainen, R.²

40
- 44649193889
- Accelerated neural evolution through cooperatively coevolved synapses
- Gomez F, Schmidhuber J, Miikkulainen R (2009) Accelerated neural evolution through cooperatively coevolved synapses. J Mach Learn Res 9:937-965
- (2009) J Mach Learn Res , vol.9 , pp. 937-965
- Gomez, F.¹ Schmidhuber, J.² Miikkulainen, R.³

41
- 0013348652
- Concerning the perceptions in general
- 3rd edn. Dover, New York
- Helmholtz H (1866/1962), Concerning the perceptions in general. In: Treatise on physiological optics, 3rd edn. Dover, New York
- (1866) Treatise on Physiological Optics
- Helmholtz, H.¹

42
- 0027803368
- Keeping neural networks simple by minimizing the description length of weights
- Hinton GE, van Camp D (1993) Keeping neural networks simple by minimizing the description length of weights. In: Proceedings of COLT-93,pp 5-13
- (1993) Proceedings of COLT-93 , pp. 5-13
- Hinton, G.E.¹ Van Camp, D.²

43
- 84867572211
- An expectation maximization algorithm for continuous markov decision processes with arbitrary rewards
- Hoffman, M, de Freitas, N, Doucet, A, Peters J (2009) An expectation maximization algorithm for continuous markov decision processes with arbitrary rewards. In: Twelfth Int. Conf. on artificial intelligence and statistics (AISTATS 2009)
- (2009) Twelfth Int. Conf. on Artificial Intelligence and Statistics (AISTATS 2009)
- Hoffman, M.¹ De Freitas, N.² Doucet, A.³ Peters, J.⁴

44
- 0003644124
- MIT Press Cambridge, M.A
- HowardRA(1960) Dynamic programming andMarkov processes. MIT Press Cambridge, M.A.
- (1960) Dynamic Programming AndMarkov Processes
- Howard, R.A.¹

45
- 0034198996
- Observable operator models for discrete stochastic time series
- Jaeger H (2000) Observable operator models for discrete stochastic time series. Neural Comput 12:1371-1398
- (2000) Neural Comput , vol.12 , pp. 1371-1398
- Jaeger, H.¹

46
- 0000305280
- From influence diagrams to junction trees
- Morgan Kaufmann, San Fransisco
- Jensen F, Jensen V, Dittmer SL (1994) From influence diagrams to junction trees. In: Proc. of the Tenth Conference on uncertainty in artificial intelligence. Morgan Kaufmann, San Fransisco
- (1994) Proc. of the Tenth Conference on Uncertainty in Artificial Intelligence
- Jensen, F.¹ Jensen, V.² Dittmer, S.L.³

47
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling LP, Littman ML, Cassandra AR (1998) Planning and acting in partially observable stochastic domains. Artif Intell 101 (1-2):99-134
- (1998) Artif Intell , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

48
- 28844435646
- Linear theory for control of nonlinear stochastic systems
- KappenHJ (2005) Linear theory for control of nonlinear stochastic systems. Phys Rev Lett 95(20):200201
- (2005) Phys Rev Lett , vol.95 , Issue.20 , pp. 200201
- Kappen, H.J.¹

49
- 29044440299
- Path integrals and symmetry breaking for optimal control theory
- Kappen HJ (2005) Path integrals and symmetry breaking for optimal control theory. J Stat Mech: Theory Exp 11:P11011
- (2005) J Stat Mech: Theory Exp , vol.11
- Kappen, H.J.¹

50
- 84862024986
- arXiv:0901.0633v2
- Kappen HJ, Gomez Y, Opper M (2009) Optimal control as a graphical model inference problem. arXiv:0901.0633v2
- (2009) Optimal Control As A Graphical Model Inference Problem
- Kappen, H.J.¹ Gomez, Y.² Opper, M.³

51
- 77954203511
- Perception and hierarchical dynamics
- Kiebel SJ, Daunizeau J, Friston KJ (2009a) Perception and hierarchical dynamics. Front Neuroinf 3:20
- (2009) Front Neuroinf , vol.3 , pp. 20
- Kiebel, S.J.¹ Daunizeau, J.² Friston, K.J.³

52
- 70049088829
- Recognizing sequences of sequences
- Kiebel SJ, von Kriegstein K, Daunizeau J, Friston KJ (2009b) Recognizing sequences of sequences. PLoS Comput Biol 5(8):e1000464
- (2009) PLoS Comput Biol , vol.5 , Issue.8
- Kiebel, S.J.¹ Von Kriegstein, K.² Daunizeau, J.³ Friston, K.J.⁴

53
- 77955962977
- Neuroeconomic approaches to mental disorders
- Kishida KT, King-Casas B, Montague PR (2010) Neuroeconomic approaches to mental disorders. Neuron 67(4):543-554
- (2010) Neuron , vol.67 , Issue.4 , pp. 543-554
- Kishida, K.T.¹ King-Casas, B.² Montague, P.R.³

54
- 0035478853
- Stochastic boolean satisfiability
- LittmanML, Majercik SM, Pitassi T (2001) Stochastic boolean satisfiability. J Autom Reason 27(3):251-296
- (2001) J Autom Reason , vol.27 , Issue.3 , pp. 251-296
- Littman, M.L.¹ Majercik, S.M.² Pitassi, T.³

55
- 84898982129
- Predictive representations of state
- Littman ML, Sutton RS, Singh S (2002) Predictive Representations of State. Adv Neural Inf Process Syst 14
- (2002) Adv Neural Inf Process Syst , vol.14
- Littman, M.L.¹ Sutton, R.S.² Singh, S.³

56
- 0029272806
- Free-energy minimisation algorithm for decoding and cryptoanalysis
- MacKay DJ (1995) Free-energy minimisation algorithm for decoding and cryptoanalysis. Electron Lett 31:445-447
- (1995) Electron Lett , vol.31 , pp. 445-447
- MacKay, D.J.¹

57
- 0028972278
- Bee foraging in uncertain environments using predictive Hebbian learning
- Montague PR,Dayan P, Person C, Sejnowski TJ (1995) Bee foraging in uncertain environments using predictive Hebbian learning. Nature 377(6551):725-728
- (1995) Nature , vol.377 , Issue.6551 , pp. 725-728
- Montague, P.R.¹ Dayan, P.² Person, C.³ Sejnowski, T.J.⁴

58
- 84867573167
- Bayesian modelling of Jumping-to-conclusions bias in delusional patients
- Moutoussis M, Bentall RP, El-Deredy W, Dayan P (2011) Bayesian modelling of Jumping-to-conclusions bias in delusional patients. Cogn Neuropsychiatry 7:1-26
- (2011) Cogn Neuropsychiatry , vol.7 , pp. 1-26
- Moutoussis, M.¹ Bentall, R.P.² El-Deredy, W.³ Dayan, P.⁴

59
- 80055087257
- A neurodynamic account of spontaneous behaviour
- Namikawa J, Nishimoto R, Tani J (2011) A neurodynamic account of spontaneous behaviour. PLoS Comput Biol. 7(10):e1002221
- (2011) PLoS Comput Biol. , vol.7 , Issue.10
- Namikawa, J.¹ Nishimoto, R.² Tani, J.³

60
- 0002788893
- A view of the em algorithm that justifies incremental sparse and other variants
- JordanM(ed.) Kluwer Academic, Dordrecht
- Neal RM, Hinton GE (1998) A view of the EM algorithm that justifies incremental sparse and other variants. In: JordanM(ed.) Learning in graphical models. Kluwer Academic, Dordrecht
- (1998) Learning in Graphical Models
- Neal, R.M.¹ Hinton, G.E.²

61
- 33750281834
- Best-response play in partially observable card games
- Oliehoek F, Spaan MTJ, Vlassis N (2005) Best-response play in partially observable card games. In: Proceedings of the 14th Annual Machine Learning Conference of Belgium and the Netherlands
- (2005) Proceedings of the 14th Annual Machine Learning Conference of Belgium and the Netherlands
- Oliehoek, F.¹ Spaan, M.T.J.² Vlassis, N.³

62
- 0003391330
- Morgan Kaufmann, San Fransisco
- Pearl J (1988) Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann, San Fransisco
- (1988) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
- Pearl, J.¹

63
- 79960241771
- Decision making under uncertainty: A neural model based on partially observable markov decision processes
- Rao RP (2010) Decision making under uncertainty: a neural model based on partially observable markov decision processes. Front Comput Neurosci 4:146
- (2010) Front Comput Neurosci , vol.4 , pp. 146
- Rao, R.P.¹

64
- 0033360288
- Predictive coding in the visual cortex: A functional interpretation of some extra-classical receptive-field effects
- Rao RP, Ballard DH (1999) Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci 2(1):79-87
- (1999) Nat Neurosci , vol.2 , Issue.1 , pp. 79-87
- Rao, R.P.¹ Ballard, D.H.²

65
- 84867583023
- arXiv: 1009.3958
- Rawlik K, Toussaint M, Vijayakumar S (2010) Approximate inference and stochastic optimal control. arXiv:1009.3958
- (2010) Approximate Inference and Stochastic Optimal Control
- Rawlik, K.¹ Toussaint, M.² Vijayakumar, S.³

66
- 0002109138
- A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
- A Black, W Prokasy (eds.). Appleton Century Crofts, New York
- Rescorla RA, Wagner AR (1972) A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In: A Black, W Prokasy (eds.) Classical conditioning II: current research and theory. Appleton Century Crofts, New York
- (1972) Classical Conditioning II: Current Research and Theory
- Rescorla, R.A.¹ Wagner, A.R.²

67
- 0009680239
- L'analyse statistique Bayesienne
- Paris, France
- RobertC(1992)L'analyse statistique Bayesienne. In: Economica. Paris, France
- (1992) Economica
- Robert, C.¹

68
- 0024038570
- Probabilistic inference and influence diagrams
- Shachter RD (1988) Probabilistic inference and influence diagrams. Operat Res 36:589-605
- (1988) Operat Res , vol.36 , pp. 589-605
- Shachter, R.D.¹

69
- 85161963598
- Monte-Carlo planning in large POMDPs
- Silver D, Veness J (2010) Monte-Carlo planning in large POMDPs. In: Proceedings of the Conference on neural information processing systems
- (2010) Proceedings of the Conference on Neural Information Processing Systems
- Silver, D.¹ Veness, J.²

70
- 0019537951
- Toward a modern theory of adaptive networks: Expectation and prediction
- Sutton RS, Barto AG (1981) Toward a modern theory of adaptive networks: expectation and prediction. Psychol Rev 88(2):135-170
- (1981) Psychol Rev , vol.88 , Issue.2 , pp. 135-170
- Sutton, R.S.¹ Barto, A.G.²

71
- 0037258296
- Learning to generate articulated behavior through the bottom-up and the top-down interaction processes
- Tani J (2003) Learning to generate articulated behavior through the bottom-up and the top-down interaction processes. Neural Netw 16(1):11-23
- (2003) Neural Netw , vol.16 , Issue.1 , pp. 11-23
- Tani, J.¹

72
- 79551503171
- A generalized path integral control approach to reinforcement learning
- Theodorou E, Buchli J, Schaal S (2010) A generalized path integral control approach to reinforcement learning. J Mach Learn Res 11:3137-3181
- (2010) J Mach Learn Res , vol.11 , pp. 3137-3181
- Theodorou, E.¹ Buchli, J.² Schaal, S.³

73
- 84923382376
- Linearly-solvable Markov decision problems
- MIT Press, Boston
- Todorov E (2006) Linearly-solvable Markov decision problems. In: Advances in neural information processing systems. MIT Press, Boston
- (2006) Advances in Neural Information Processing Systems
- Todorov, E.¹

74
- 62949148891
- General duality between optimal control and estimation
- Todorov E (2008) General duality between optimal control and estimation. In: IEEE Conference on decision and control
- (2008) IEEE Conference on Decision and Control
- Todorov, E.¹

75
- 67349102783
- Hierarchical POMDP controller optimization by likelihood maximization
- AUAI Press, Menlo Park
- Toussaint M, Charlin L, Poupart P (2008) Hierarchical POMDP controller optimization by likelihood maximization. In: Uncertainty in artificial intelligence (UAI 2008), AUAI Press, Menlo Park
- (2008) Uncertainty in Artificial Intelligence (UAI 2008)
- Toussaint, M.¹ Charlin, L.² Poupart, P.³

76
- 34250728061
- Probabilistic inference for solving discrete and continuous stateMarkov decision processes
- Toussaint M, Storkey A (2006) Probabilistic inference for solving discrete and continuous stateMarkov decision processes. In: Proceedings of the 23nd International Conference on machine learning
- (2006) Proceedings of the 23nd International Conference on Machine Learning
- Toussaint, M.¹ Storkey, A.²

77
- 52249107868
- Graphical model inference in optimal control of stochastic multi-agent systems
- Van den Broek B, Wiegerinck W, Kappen B (2008) Graphical model inference in optimal control of stochastic multi-agent systems. J Artif Int Res 32(1):95-122
- (2008) J Artif Int Res , vol.32 , Issue.1 , pp. 95-122
- Van Den Broek, B.¹ Wiegerinck, W.² Kappen, B.³

78
- 34249833101
- Q-learning
- Watkins CJ, Dayan P (1992) Q-learning. Mach Learn 8:279-292
- (1992) Mach Learn , vol.8 , pp. 279-292
- Watkins, C.J.¹ Dayan, P.²

79
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- WilliamsRJ (1992) Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach Learn 8:229-256
- (1992) Mach Learn , vol.8 , pp. 229-256
- Williams, R.J.¹

80
- 0032210433
- Probabilistic inference in influence diagrams
- Zhang NL (1998) Probabilistic inference in influence diagrams. Comput Intell 14(4):475-497
- (1998) Comput Intell , vol.14 , Issue.4 , pp. 475-497
- Zhang, N.L.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.