-
2
-
-
48349140736
-
Rollout sampling approximate policy iteration
-
September
-
Christos Dimitrakakis and Michail G. Lagoudakis. Rollout sampling approximate policy iteration. Machine Learning, 72(3), September 2008.
-
(2008)
Machine Learning
, vol.72
, Issue.3
-
-
Dimitrakakis, C.1
Lagoudakis, M.G.2
-
4
-
-
77955839714
-
Learning motor primitives in robotics
-
D. Schuurmans, J. Benigio, and D. Koller, editors, Cambridge, MA: MIT Press. clmc
-
J. Koeber and J. Peters. Learning motor primitives in robotics. In D. Schuurmans, J. Benigio, and D. Koller, editors, Advances in Neural Information Processing Systems 21 (NIPS 2008), Vancouver, BC, Dec. 8-11, 2009. Cambridge, MA: MIT Press. clmc.
-
Advances in Neural Information Processing Systems 21 (NIPS 2008), Vancouver, BC, Dec. 8-11, 2009
-
-
Koeber, J.1
Peters, J.2
-
7
-
-
61849173491
-
Gaussian Process Dynamic Programming
-
March
-
Marc P. Deisenroth, Carl E. Rasmussen, and Jan Peters. Gaussian Process Dynamic Programming. Neurocomputing, 72(7-9):1508-1524, March 2009.
-
(2009)
Neurocomputing
, vol.72
, Issue.7-9
, pp. 1508-1524
-
-
Deisenroth, M.P.1
Rasmussen, C.E.2
Peters, J.3
-
8
-
-
34547996989
-
Bayesian actor-critic algorithms
-
New York, NY, USA, ACM
-
Mohammad Ghavamzadeh and Yaakov Engel. Bayesian actor-critic algorithms. In ICML '07: Proceedings of the 24th international conference on Machine learning, pages 297-304, New York, NY, USA, 2007. ACM.
-
(2007)
ICML '07: Proceedings of the 24th International Conference on Machine Learning
, pp. 297-304
-
-
Ghavamzadeh, M.1
Engel, Y.2
-
9
-
-
70349327392
-
Learning model-free robot control by a monte carlo em algorithm
-
Nikos Vlassis, Marc Toussaint, Georgios Kontes, and Savas Piperidis. Learning model-free robot control by a monte carlo em algorithm. Auton. Robots, 27(2):123-130, 2009.
-
(2009)
Auton. Robots
, vol.27
, Issue.2
, pp. 123-130
-
-
Vlassis, N.1
Toussaint, M.2
Kontes, G.3
Piperidis, S.4
-
11
-
-
52249107868
-
Graphical model inference in optimal control of stochastic multi-agent systems
-
B. van den Broek, W. Wiegerinck, and B. Kappen. Graphical model inference in optimal control of stochastic multi-agent systems. Journal of Artificial Intelligence Research, 32(1):95-122, 2008.
-
(2008)
Journal of Artificial Intelligence Research
, vol.32
, Issue.1
, pp. 95-122
-
-
Van Den Broek, B.1
Wiegerinck, W.2
Kappen, B.3
-
12
-
-
84899019754
-
Learning attractor landscapes for learning motor primitives
-
S. Becker, S. Thrun, and K. Obermayer, editors, Cambridge, MA: MIT Press
-
A. Ijspeert, J. Nakanishi, and S. Schaal. Learning attractor landscapes for learning motor primitives. In S. Becker, S. Thrun, and K. Obermayer, editors, Advances in Neural Information Processing Systems 15, pages 1547-1554. Cambridge, MA: MIT Press, 2003.
-
(2003)
Advances in Neural Information Processing Systems 15
, pp. 1547-1554
-
-
Ijspeert, A.1
Nakanishi, J.2
Schaal, S.3
-
14
-
-
0004294973
-
-
Dover books on advanced mathematics. Dover Publications, New York, 94020406 Robert F. Stengel. ill. ; 21 cm. Originally published: Stochastic optimal control. New York ; Wiley, c1986. With new pref. Includes bibliographical references and index
-
Robert F. Stengel. Optimal control and estimation. Dover books on advanced mathematics. Dover Publications, New York, 1994. 94020406 Robert F. Stengel. ill. ; 21 cm. Originally published: Stochastic optimal control. New York ; Wiley, c1986. With new pref. Includes bibliographical references and index.
-
(1994)
Optimal Control and Estimation
-
-
Stengel, R.F.1
-
15
-
-
0003423896
-
-
Springer, New York, 2nd edition, 2005929857 Wendell H. Fleming, H. Mete Soner. 25 cm
-
Wendell Helms Fleming and H. Mete Soner. Controlled Markov processes and viscosity solutions. Applications of mathematics. Springer, New York, 2nd edition, 2006. 2005929857 Wendell H. Fleming, H. Mete Soner. 25 cm.
-
(2006)
Controlled Markov Processes and Viscosity Solutions. Applications of Mathematics
-
-
Fleming, W.H.1
Mete Soner, H.2
-
17
-
-
0031341708
-
-
vol.3, Dec
-
Jiongmin Yong. Relations among odes, pdes, fsdes, bsdes, and fbsdes. volume 3, pages 2779-2784 vol.3, Dec 1997.
-
(1997)
Relations among Odes, Pdes, Fsdes, Bsdes, and Fbsdes
, vol.3
, pp. 2779-2784
-
-
Yong, J.1
-
18
-
-
29044440299
-
Path integrals and symmetry breaking for optimal control theory
-
H J Kappen. Path integrals and symmetry breaking for optimal control theory. Journal of Statistical Mechanics: Theory and Experiment, 2005(11):P11011, 2005.
-
(2005)
Journal of Statistical Mechanics: Theory and Experiment
, vol.2005
, Issue.11
, pp. 11011
-
-
Kappen, H.J.1
-
19
-
-
33947410345
-
An introduction to stochastic control theory, path integrals and reinforcement learning
-
J. Marro, P. L. Garrido, and J. J. Torres, editors, Cooperative Behavior in Neural Systems, February
-
H. J. Kappen. An introduction to stochastic control theory, path integrals and reinforcement learning. In J. Marro, P. L. Garrido, and J. J. Torres, editors, Cooperative Behavior in Neural Systems, volume 887 of American Institute of Physics Conference Series, pages 149-181, February 2007.
-
(2007)
American Institute of Physics Conference Series
, vol.887
, pp. 149-181
-
-
Kappen, H.J.1
-
20
-
-
28844435646
-
Linear theory for control of nonlinear stochastic systems
-
Nov
-
Hilbert J. Kappen. Linear theory for control of nonlinear stochastic systems. Phys. Rev. Lett., 95(20):200201, Nov 2005.
-
(2005)
Phys. Rev. Lett.
, vol.95
, Issue.20
, pp. 200201
-
-
Kappen, H.J.1
-
24
-
-
67650915125
-
Efficient computation of optimal actions
-
Emanuel Todorov. Efficient computation of optimal actions. Proc Natl Acad Sci U S A, 106(28):11478-83.
-
Proc Natl Acad Sci U S A
, vol.106
, Issue.28
, pp. 11478-11483
-
-
Todorov, E.1
-
25
-
-
28844435646
-
Linear theory for control of nonlinear stochastic systems
-
Journal Article United States
-
H. J. Kappen. Linear theory for control of nonlinear stochastic systems. Phys Rev Lett, 95(20):200201, 2005. Journal Article United States.
-
(2005)
Phys Rev Lett
, vol.95
, Issue.20
, pp. 200201
-
-
Kappen, H.J.1
-
26
-
-
49949095696
-
Stochastic optimal control in continuous space-time multi-agent system
-
W. Wiegerinck, B. van den Broek, and H. J. Kappen. Stochastic optimal control in continuous space-time multi-agent system. In UAI, 2006.
-
(2006)
UAI
-
-
Wiegerinck, W.1
Van Den Broek, B.2
Kappen, H.J.3
|