메뉴 건너뛰기




Volumn 87, Issue 2, 2012, Pages 159-182

Optimal control as a graphical model inference problem

Author keywords

Approximate inference; Belief propagation; Cluster variation method; Graphical model; Kullback leibler divergence; Optimal control; Uncontrolled dynamics

Indexed keywords

APPROXIMATE INFERENCE; BELIEF PROPAGATION; CLUSTER VARIATION METHODS; GRAPHICAL MODEL; KULLBACK LEIBLER DIVERGENCE; OPTIMAL CONTROLS; UNCONTROLLED DYNAMICS;

EID: 84862024986     PISSN: 08856125     EISSN: 15730565     Source Type: Journal    
DOI: 10.1007/s10994-012-5278-7     Document Type: Article
Times cited : (344)

References (40)
  • 1
    • 35548937764 scopus 로고    scopus 로고
    • Haplotype inference in general pedigrees using the cluster variation method
    • Albers, C. A., Heskes, T., & Kappen, H. J. (2007). Haplotype inference in general pedigrees using the cluster variation method. Genetics, 177(2), 1101-1118.
    • (2007) Genetics , vol.177 , Issue.2 , pp. 1101-1118
    • Albers, C.A.1    Heskes, T.2    Kappen, H.J.3
  • 2
    • 33947331887 scopus 로고    scopus 로고
    • The cluster variation method for efficient linkage analysis on extended pedigrees
    • Albers, C. A., Leisink, M. A. R., & Kappen, H. J. (2006). The cluster variation method for efficient linkage analysis on extended pedigrees. BMC Bioinformatics, 7 (S-1).
    • (2006) BMC Bioinformatics , vol.7 , Issue.S-1
    • Albers, C.A.1    Leisink, M.A.R.2    Kappen, H.J.3
  • 8
    • 70349666986 scopus 로고    scopus 로고
    • Linear Bellman combination for control of character animation
    • da Silva, M., Durand, F., & Popović, J. (2009). Linear Bellman combination for control of character animation. ACM Transactions on Graphics, 28(3), 82:1-82:10.
    • (2009) ACM Transactions on Graphics , vol.28 , Issue.3 , pp. 821-8210
    • Da Silva, M.1    Durand, F.2    Popović, J.3
  • 9
    • 0346982426 scopus 로고    scopus 로고
    • Using expectation-maximization for reinforcement learning
    • Dayan, P., & Hinton, G. E. (1997). Using expectation-maximization for reinforcement learning. Neural Computation, 9(2), 271-278. (Pubitemid 127635391)
    • (1997) Neural Computation , vol.9 , Issue.2 , pp. 271-278
    • Dayan, P.1    Hinton, G.E.2
  • 10
    • 68149131857 scopus 로고    scopus 로고
    • Reinforcement learning or active inference?
    • Friston, K. J., Daunizeau, J., & Kiebel, S. J. (2009). Reinforcement learning or active inference? PLoS ONE, 4(7), e6421.
    • (2009) PLoS ONE , vol.4 , Issue.7
    • Friston, K.J.1    Daunizeau, J.2    Kiebel, S.J.3
  • 13
    • 28844435646 scopus 로고    scopus 로고
    • Linear theory for control of nonlinear stochastic systems
    • Kappen, H. J. (2005). Linear theory for control of nonlinear stochastic systems. Physical Review Letters, 95(20), 200201.
    • (2005) Physical Review Letters , vol.95 , Issue.20 , pp. 200201
    • Kappen, H.J.1
  • 14
    • 84898961717 scopus 로고    scopus 로고
    • Novel iteration schemes for the cluster variation method
    • Cambridge: MIT Press
    • Kappen, H. J., & Wiegerinck, W. (2002). Novel iteration schemes for the cluster variation method. In Advances in neural information processing systems (Vol. 14, pp. 415-422). Cambridge: MIT Press.
    • (2002) Advances in Neural Information Processing Systems , vol.14 , pp. 415-422
    • Kappen, H.J.1    Wiegerinck, W.2
  • 15
    • 78049390740 scopus 로고    scopus 로고
    • Policy search for motor primitives in robotics
    • Kober, J., & Peters, J. (2011). Policy search for motor primitives in robotics. Machine Learning, 84(1-2), 171-203.
    • (2011) Machine Learning , vol.84 , Issue.1-2 , pp. 171-203
    • Kober, J.1    Peters, J.2
  • 18
    • 77956951736 scopus 로고    scopus 로고
    • LibDAI: A free and open source C++ library for discrete approximate inference in graphical models
    • Mooij, J. M. (2010). libDAI: A free and open source C++ library for discrete approximate inference in graphical models. Journal of Machine Learning Research, 11, 2169-2173.
    • (2010) Journal of Machine Learning Research , vol.11 , pp. 2169-2173
    • Mooij, J.M.1
  • 30
    • 84864055301 scopus 로고    scopus 로고
    • Linearly-solvable Markov decision problems
    • Cambridge: MIT Press
    • Todorov, E. (2007). Linearly-solvable Markov decision problems. In Advances in neural information processing systems (Vol. 19, pp. 1369-1376). Cambridge: MIT Press.
    • (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 1369-1376
    • Todorov, E.1
  • 31
    • 62949148891 scopus 로고    scopus 로고
    • General duality between optimal control and estimation
    • Todorov, E. (2008). General duality between optimal control and estimation. In 47th IEEE conference on decision and control (pp. 4286-4292).
    • (2008) 47th IEEE Conference on Decision and Control , pp. 4286-4292
    • Todorov, E.1
  • 35
    • 49949091811 scopus 로고    scopus 로고
    • Optimal control in large stochastic multi-agent systems. Adaptive Agents and Multi-Agent Systems III
    • van den Broek, B., Wiegerinck, W., & Kappen, H. J. (2008b). Optimal control in large stochastic multi-agent systems. Adaptive Agents and Multi-Agent Systems III. Adaptation and Multi-Agent Learning, 4865, 15-26.
    • (2008) Adaptation and Multi-agent Learning , vol.4865 , pp. 15-26
    • Van Den Broek, B.1    Wiegerinck, W.2    Kappen, H.J.3
  • 38
    • 84898975095 scopus 로고    scopus 로고
    • Generalized belief propagation
    • T. K. Leen, T. G. Dieterich, & V. Tresp Eds., Cambridge: MIT Press
    • Yedidia, J., Freeman, W., & Weiss, Y. (2001). Generalized belief propagation. In T. K. Leen, T. G. Dieterich, & V. Tresp (Eds.), Advances in neural information processing systems (Vol. 13, pp. 689-995). Cambridge: MIT Press.
    • (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 689-995
    • Yedidia, J.1    Freeman, W.2    Weiss, Y.3
  • 39
    • 23744513375 scopus 로고    scopus 로고
    • Constructing free-energy approximations and generalized belief propagation algorithms
    • DOI 10.1109/TIT.2005.850085
    • Yedidia, J., Freeman, W., & Weiss, Y. (2005). Constructing free-energy approximations and generalized belief propagation algorithms. IEEE Transactions on Information Theory, 51(7), 2282-2312. (Pubitemid 41136394)
    • (2005) IEEE Transactions on Information Theory , vol.51 , Issue.7 , pp. 2282-2312
    • Yedidia, J.S.1    Freeman, W.T.2    Weiss, Y.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.