SCOPUS 정보 검색 플랫폼

Volumn 87, Issue 2, 2012, Pages 159-182

Optimal control as a graphical model inference problem

(3) Kappen, Hilbert J a Gómez, Vicenç a Opper, Manfred b

b TECHNISCHE UNIVERSITÄT BERLIN (Germany)

Author keywords

Approximate inference; Belief propagation; Cluster variation method; Graphical model; Kullback leibler divergence; Optimal control; Uncontrolled dynamics

Indexed keywords

APPROXIMATE INFERENCE; BELIEF PROPAGATION; CLUSTER VARIATION METHODS; GRAPHICAL MODEL; KULLBACK LEIBLER DIVERGENCE; OPTIMAL CONTROLS; UNCONTROLLED DYNAMICS;

GRAPHIC METHODS; OPTIMAL CONTROL SYSTEMS;

CONTROL;

EID: 84862024986 PISSN: 08856125 EISSN: 15730565 Source Type: Journal
DOI: 10.1007/s10994-012-5278-7 Document Type: Article

Times cited : (344)

References (40)

1
- 35548937764
- Haplotype inference in general pedigrees using the cluster variation method
- Albers, C. A., Heskes, T., & Kappen, H. J. (2007). Haplotype inference in general pedigrees using the cluster variation method. Genetics, 177(2), 1101-1118.
- (2007) Genetics , vol.177 , Issue.2 , pp. 1101-1118
- Albers, C.A.¹ Heskes, T.² Kappen, H.J.³

2
- 33947331887
- The cluster variation method for efficient linkage analysis on extended pedigrees
- Albers, C. A., Leisink, M. A. R., & Kappen, H. J. (2006). The cluster variation method for efficient linkage analysis on extended pedigrees. BMC Bioinformatics, 7 (S-1).
- (2006) BMC Bioinformatics , vol.7 , Issue.S-1
- Albers, C.A.¹ Leisink, M.A.R.² Kappen, H.J.³

3
- 84858765598
- Covariant policy search
- San Francisco: Morgan Kaufmann
- Bagnell, J. A., & Schneider, J. (2003). Covariant policy search. In IJCAI'03: Proceedings of the 18th international joint conference on artificial intelligence (pp. 1019-1024). San Francisco: Morgan Kaufmann.
- (2003) IJCAI'03: Proceedings of the 18th International Joint Conference on Artificial Intelligence , pp. 1019-1024
- Bagnell, J.A.¹ Schneider, J.²

4
- 0003487482
- Belmont: Athena Scientific
- Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-dynamic programming. Belmont: Athena Scientific.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 84902006506
- Bierkens, J., & Kappen, B. (2012). Kl-learning: Online solution of Kullback-Leibler control problems. http://arxiv.org/abs/1112.1996.
- (2012) Kl-learning: Online Solution of Kullback-Leibler Control Problems
- Bierkens, J.¹ Kappen, B.²

6
- 85166207010
- Exploiting structure in policy construction
- San Francisco: Morgan Kaufmann
- Boutilier, C., Dearden, R., & Goldszmidt, M. (1995). Exploiting structure in policy construction. In IJCAI'95: Proceedings of the 14th international joint conference on artificial intelligence (pp. 1104-1111). San Francisco: Morgan Kaufmann.
- (1995) IJCAI'95: Proceedings of the 14th International Joint Conference on Artificial Intelligence , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

7
- 0008586604
- A method for using belief networks as influence diagrams
- Cooper, G. (1988). A method for using belief networks as influence diagrams. In Proceedings of the workshop on uncertainty in artificial intelligence (UAI'88) (pp. 55-63).
- (1988) Proceedings of the Workshop on Uncertainty in Artificial Intelligence (UAI'88) , pp. 55-63
- Cooper, G.¹

8
- 70349666986
- Linear Bellman combination for control of character animation
- da Silva, M., Durand, F., & Popović, J. (2009). Linear Bellman combination for control of character animation. ACM Transactions on Graphics, 28(3), 82:1-82:10.
- (2009) ACM Transactions on Graphics , vol.28 , Issue.3 , pp. 821-8210
- Da Silva, M.¹ Durand, F.² Popović, J.³

9
- 0346982426
- Using expectation-maximization for reinforcement learning
- Dayan, P., & Hinton, G. E. (1997). Using expectation-maximization for reinforcement learning. Neural Computation, 9(2), 271-278. (Pubitemid 127635391)
- (1997) Neural Computation , vol.9 , Issue.2 , pp. 271-278
- Dayan, P.¹ Hinton, G.E.²

10
- 68149131857
- Reinforcement learning or active inference?
- Friston, K. J., Daunizeau, J., & Kiebel, S. J. (2009). Reinforcement learning or active inference? PLoS ONE, 4(7), e6421.
- (2009) PLoS ONE , vol.4 , Issue.7
- Friston, K.J.¹ Daunizeau, J.² Kiebel, S.J.³

11
- 6344271885
- Approximate inference and constrained optimization
- San Francisco: Morgan Kaufmann
- Heskes, T., Albers, K., & Kappen, H. J. (2003). Approximate inference and constrained optimization. In Proceedings of the 19th conference on uncertainty in artificial intelligence (UAI'03), Acapulco, Mexico, (pp. 313-320). San Francisco: Morgan Kaufmann.
- (2003) Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence (UAI'03), Acapulco, Mexico , pp. 313-320
- Heskes, T.¹ Albers, K.² Kappen, H.J.³

12
- 0004283231
- Cambridge: MIT Press
- Jordan, M. I. (Ed.) (1999). Learning in graphical models. Cambridge: MIT Press.
- (1999) Learning in Graphical Models
- Jordan, M.I.¹

13
- 28844435646
- Linear theory for control of nonlinear stochastic systems
- Kappen, H. J. (2005). Linear theory for control of nonlinear stochastic systems. Physical Review Letters, 95(20), 200201.
- (2005) Physical Review Letters , vol.95 , Issue.20 , pp. 200201
- Kappen, H.J.¹

14
- 84898961717
- Novel iteration schemes for the cluster variation method
- Cambridge: MIT Press
- Kappen, H. J., & Wiegerinck, W. (2002). Novel iteration schemes for the cluster variation method. In Advances in neural information processing systems (Vol. 14, pp. 415-422). Cambridge: MIT Press.
- (2002) Advances in Neural Information Processing Systems , vol.14 , pp. 415-422
- Kappen, H.J.¹ Wiegerinck, W.²

15
- 78049390740
- Policy search for motor primitives in robotics
- Kober, J., & Peters, J. (2011). Policy search for motor primitives in robotics. Machine Learning, 84(1-2), 171-203.
- (2011) Machine Learning , vol.84 , Issue.1-2 , pp. 171-203
- Kober, J.¹ Peters, J.²

16
- 84880688552
- Computing factored value functions for policies in structured mdps
- San Francisco: Morgan Kaufmann
- Koller, D., & Parr, R. (1999). Computing factored value functions for policies in structured mdps. In IJCAI'99: Proceedings of the 16th international joint conference on artificial intelligence (pp. 1332-1339). San Francisco: Morgan Kaufmann.
- (1999) IJCAI'99: Proceedings of the 16th International Joint Conference on Artificial Intelligence , pp. 1332-1339
- Koller, D.¹ Parr, R.²

17
- 0001006209
- Local computations with probabilities on graphical structures and their application to expert systems
- Lauritzen, S. L., & Spiegelhalter, D. J. (1988). Local computations with probabilities on graphical structures and their application to expert systems. Journal of the Royal Statistical Society. Series B. Methodological, 50(2), 154-227.
- (1988) Journal of the Royal Statistical Society. Series B. Methodological , vol.50 , Issue.2 , pp. 154-227
- Lauritzen, S.L.¹ Spiegelhalter, D.J.²

18
- 77956951736
- LibDAI: A free and open source C++ library for discrete approximate inference in graphical models
- Mooij, J. M. (2010). libDAI: A free and open source C++ library for discrete approximate inference in graphical models. Journal of Machine Learning Research, 11, 2169-2173.
- (2010) Journal of Machine Learning Research , vol.11 , pp. 2169-2173
- Mooij, J.M.¹

19
- 0002425879
- Loopy belief propagation for approximate inference: An empirical study
- San Francisco: Morgan Kaufmann
- Murphy, K., Weiss, Y., & Jordan, M. (1999). Loopy belief propagation for approximate inference: An empirical study. In Proceedings of the 15th conference on uncertainty in artificial intelligence (UAI'99) (pp. 467-475). San Francisco: Morgan Kaufmann.
- (1999) Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence (UAI'99) , pp. 467-475
- Murphy, K.¹ Weiss, Y.² Jordan, M.³

20
- 77958569725
- Relative entropy policy search
- Menlo Park: AAAI Press
- Peters, J., Mülling, K., & Altün, Y. (2010). Relative entropy policy search. In Proceedings of the 24th AAAI conference on artificial intelligence (AAAI 2010) (pp. 1607-1612). Menlo Park: AAAI Press.
- (2010) Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI 2010) , pp. 1607-1612
- Peters, J.¹ Mülling, K.² Altün, Y.³

21
- 0003584577
- Upper Saddle River: Prentice-Hall, Inc
- Russell, S. J., Norvig, P., Candy, J. F., Malik, J. M., & Edwards, D. D. (1996). Artificial intelligence: a modern approach. Upper Saddle River: Prentice-Hall, Inc.
- (1996) Artificial Intelligence: A Modern Approach
- Russell, S.J.¹ Norvig, P.² Candy, J.F.³ Malik, J.M.⁴ Edwards, D.D.⁵

22
- 0039355728
- Decision making using probabilistic inference methods
- San Francisco: Morgan Kaufmann
- Shachter, R. D., & Peot, M. A. (1992). Decision making using probabilistic inference methods. In Proceedings of the 8th conference on uncertainty in artificial intelligence (UAI'92) (pp. 276-283). San Francisco: Morgan Kaufmann.
- (1992) Proceedings of the 8th Conference on Uncertainty in Artificial Intelligence (UAI'92) , pp. 276-283
- Shachter, R.D.¹ Peot, M.A.²

23
- 84953296993
- Cambridge: Cambridge University Press
- Skyrms, B. (1996). Evolution of the social contract. Cambridge: Cambridge University Press.
- (1996) Evolution of the Social Contract
- Skyrms, B.¹

24
- 84924503282
- Cambridge: Cambridge University Press
- Skyrms, B. (Ed.) (2004). The stag hunt and evolution of social structure. Cambridge: Cambridge University Press.
- (2004) The Stag Hunt and Evolution of Social Structure
- Skyrms, B.¹

25
- 0004294973
- New York: Dover Publications, Inc
- Stengel, R. F. (1994). Optimal control and estimation. New York: Dover Publications, Inc.
- (1994) Optimal Control and Estimation
- Stengel, R.F.¹

26
- 0025399873
- Dynamic programming and influence diagrams
- DOI 10.1109/21.52548
- Tatman, J., & Shachter, R. (1990). Dynamic programming and influence diagrams. IEEE Transactions on Systems, Man, and Cybernetics, 20(2), 365-379. (Pubitemid 20702819)
- (1990) IEEE Transactions on Systems, Man and Cybernetics , vol.20 , Issue.2 , pp. 365-379
- Tatman Joseph, A.¹ Shachter Ross, D.²

27
- 67650458713
- Path integral-based stochastic optimal control for rigid body dynamics
- Theodorou, E. A., Buchli, J., & Schaal, S. (2009). Path integral-based stochastic optimal control for rigid body dynamics. In Adaptive dynamic programming and reinforcement learning, 2009. ADPRL'09. IEEE symposium on (pp. 219-225).
- (2009) Adaptive Dynamic Programming and Reinforcement Learning, 2009. ADPRL'09. IEEE Symposium on , pp. 219-225
- Theodorou, E.A.¹ Buchli, J.² Schaal, S.³

28
- 84862011769
- Learning policy improvements with path integrals
- Theodorou, E. A., Buchli, J., & Schaal, S. (2010a). Learning policy improvements with path integrals. In International conference on artificial intelligence and statistics (AISTATS 2010).
- (2010) International Conference on Artificial Intelligence and Statistics (AISTATS 2010)
- Theodorou, E.A.¹ Buchli, J.² Schaal, S.³

29
- 77955836276
- Reinforcement learning of motor skills in high dimensions: A path integral approach
- New York: IEEE Press
- Theodorou, E. A., Buchli, J., & Schaal, S. (2010b). Reinforcement learning of motor skills in high dimensions: A path integral approach. In Proceedings of the international conference on robotics and automation (ICRA 2010) (pp. 2397-2403). New York: IEEE Press.
- (2010) Proceedings of the International Conference on Robotics and Automation (ICRA 2010) , pp. 2397-2403
- Theodorou, E.A.¹ Buchli, J.² Schaal, S.³

30
- 84864055301
- Linearly-solvable Markov decision problems
- Cambridge: MIT Press
- Todorov, E. (2007). Linearly-solvable Markov decision problems. In Advances in neural information processing systems (Vol. 19, pp. 1369-1376). Cambridge: MIT Press.
- (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 1369-1376
- Todorov, E.¹

31
- 62949148891
- General duality between optimal control and estimation
- Todorov, E. (2008). General duality between optimal control and estimation. In 47th IEEE conference on decision and control (pp. 4286-4292).
- (2008) 47th IEEE Conference on Decision and Control , pp. 4286-4292
- Todorov, E.¹

32
- 67650915125
- Efficient computation of optimal actions
- Todorov, E. (2009). Efficient computation of optimal actions. Proceedings of the National Academy of Sciences of the United States of America, 106(28), 11478-11483.
- (2009) Proceedings of the National Academy of Sciences of the United States of America , vol.106 , Issue.28 , pp. 11478-11483
- Todorov, E.¹

33
- 33749234798
- Probabilistic inference for solving discrete and continuous state Markov decision processes
- New York: ACM
- Toussaint, M., & Storkey, A. (2006). Probabilistic inference for solving discrete and continuous state Markov decision processes. In ICML'06: Proceedings of the 23rd international conference on machine learning (pp. 945-952). New York: ACM.
- (2006) ICML'06: Proceedings of the 23rd International Conference on Machine Learning , pp. 945-952
- Toussaint, M.¹ Storkey, A.²

34
- 52249107868
- Graphical model inference in optimal control of stochastic multi-agent systems
- van den Broek, B., Wiegerinck, W., & Kappen, H. J. (2008a). Graphical model inference in optimal control of stochastic multi-agent systems. Journal of Artificial Intelligence Research, 32(1), 95-122.
- (2008) Journal of Artificial Intelligence Research , vol.32 , Issue.1 , pp. 95-122
- Van Den Broek, B.¹ Wiegerinck, W.² Kappen, H.J.³

35
- 49949091811
- Optimal control in large stochastic multi-agent systems. Adaptive Agents and Multi-Agent Systems III
- van den Broek, B., Wiegerinck, W., & Kappen, H. J. (2008b). Optimal control in large stochastic multi-agent systems. Adaptive Agents and Multi-Agent Systems III. Adaptation and Multi-Agent Learning, 4865, 15-26.
- (2008) Adaptation and Multi-agent Learning , vol.4865 , pp. 15-26
- Van Den Broek, B.¹ Wiegerinck, W.² Kappen, H.J.³

36
- 49949095696
- Stochastic optimal control in continuous spacetime multi-agent systems
- Arlington, Virginia, Corvallis: AUAI Press
- Wiegerinck, W., van den Broek, B., & Kappen, H. J. (2006). Stochastic optimal control in continuous spacetime multi-agent systems. In Proceedings of the 22nd conference on uncertainty in artificial intelligence (UAI'06), Arlington, Virginia (pp. 528-535). Corvallis: AUAI Press.
- (2006) Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence (UAI'06) , pp. 528-535
- Wiegerinck, W.¹ Van Den Broek, B.² Kappen, H.J.³

37
- 60349096142
- Optimal on-line scheduling in stochastic multiagent systems in continuous space and time
- Wiegerinck, W., van den Broek, B., & Kappen, H. J. (2007). Optimal on-line scheduling in stochastic multiagent systems in continuous space and time. In Proceedings of the 6th international joint conference on autonomous agents and multiagent systems AAMAS 07 (pp. 749-756).
- (2007) Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems AAMAS 07 , pp. 749-756
- Wiegerinck, W.¹ Van Den Broek, B.² Kappen, H.J.³

38
- 84898975095
- Generalized belief propagation
- T. K. Leen, T. G. Dieterich, & V. Tresp Eds., Cambridge: MIT Press
- Yedidia, J., Freeman, W., & Weiss, Y. (2001). Generalized belief propagation. In T. K. Leen, T. G. Dieterich, & V. Tresp (Eds.), Advances in neural information processing systems (Vol. 13, pp. 689-995). Cambridge: MIT Press.
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 689-995
- Yedidia, J.¹ Freeman, W.² Weiss, Y.³

39
- 23744513375
- Constructing free-energy approximations and generalized belief propagation algorithms
- DOI 10.1109/TIT.2005.850085
- Yedidia, J., Freeman, W., & Weiss, Y. (2005). Constructing free-energy approximations and generalized belief propagation algorithms. IEEE Transactions on Information Theory, 51(7), 2282-2312. (Pubitemid 41136394)
- (2005) IEEE Transactions on Information Theory , vol.51 , Issue.7 , pp. 2282-2312
- Yedidia, J.S.¹ Freeman, W.T.² Weiss, Y.³

40
- 58149154664
- Game theory of mind
- Yoshida, W., Dolan, R. J., & Friston, K. J. (2008). Game theory of mind. PLoS Computational Biology, 4(12), e1000254.
- (2008) PLoS Computational Biology , vol.4 , Issue.12
- Yoshida, W.¹ Dolan, R.J.² Friston, K.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.