SCOPUS 정보 검색 플랫폼

Proceedings of the 29th International Conference on Machine Learning, ICML 2012

Volumn 1, Issue , 2012, Pages 281-288

Path integral policy improvement with covariance matrix adaptation

(2) Stulp, Freek a,b Sigaud, Olivier c

a ENSTA PARISTECH (France)

b INRIA (France)

c UNIVERSITÉ PIERRE ET MARIE CURIE (France)

Author keywords

[No Author keywords available]

Indexed keywords

CONCEPTUAL LEVELS; CONTINUOUS STATE; COVARIANCE MATRIX ADAPTATION; CROSS-ENTROPY METHOD; NOVEL ALGORITHM; PARAMETERIZED; PATH INTEGRAL; STATISTICAL ESTIMATION; STOCHASTIC OPTIMAL CONTROL;

ALGORITHMS; OPTIMIZATION; QUANTUM THEORY; REINFORCEMENT LEARNING;

ITERATIVE METHODS;

EID: 84867129779 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (132)

References (14)

1
- 79551686776
- Cross-entropy optimization of control policies with adaptive basis functions
- Busoniu, L., Ernst, D., Schutter, B. De, and Babuska, R. Cross-entropy optimization of control policies with adaptive basis functions. IEEE Transactions on Systems, Man, and Cybernetics, 41(1):196-209, 2011.
- (2011) IEEE Transactions on Systems, Man, and Cybernetics , vol.41 , Issue.1 , pp. 196-209
- Busoniu, L.¹ Ernst, D.² De Schutter, B.³ Babuska, R.⁴

2
- 33749260658
- Meboo Publishing USA
- Dattorro, J. Convex Optimization & Euclidean Distance Geometry. Meboo Publishing USA, 2011.
- (2011) Convex Optimization & Euclidean Distance Geometry
- Dattorro, J.¹

3
- 0035377566
- Completely derandomized self-adaptation in evolution strategies
- Hansen, N. and Ostermeier, A. Completely derandomized self-adaptation in evolution strategies. Evolutionary Computation, 9(2):159-195, 2001.
- (2001) Evolutionary Computation , vol.9 , Issue.2 , pp. 159-195
- Hansen, N.¹ Ostermeier, A.²

4
- 77952322771
- Evolution strategies for direct policy search
- Heidrich-Meisnerm V. and Igel, C. Evolution strategies for direct policy search. In Proc. of the Int'l Conference on Parallel Problem Solving from Nature, 2008.
- Proc. of the Int'l Conference on Parallel Problem Solving from Nature, 2008
- Heidrich-Meisnerm, V.¹ Igel, C.²

5
- 0036059542
- Movement imitation with nonlinear dynamical systems in humanoid robots
- Ijspeert, A. J., Nakanishi, J., and Schaal, S. Movement imitation with nonlinear dynamical systems in humanoid robots. In Proc. of the IEEE Int'l Conference on Robotics and Automation (ICRA), 2002.
- Proc. of the IEEE Int'l Conference on Robotics and Automation (ICRA), 2002
- Ijspeert, A.J.¹ Nakanishi, J.² Schaal, S.³

6
- 78049390740
- Policy search for motor primitives in robotics
- Kober, J. and Peters, J. Policy search for motor primitives in robotics. Machine Learning, 84:171-203, 2011.
- (2011) Machine Learning , vol.84 , pp. 171-203
- Kober, J.¹ Peters, J.²

7
- 84866005469
- Cross-entropy randomized motion planning
- Kobilarov, M. Cross-entropy randomized motion planning. In Proceedings of Robotics: Science and Systems, 2011.
- Proceedings of Robotics: Science and Systems, 2011
- Kobilarov, M.¹

8
- 1942516890
- The Cross-Entropy Method for fast policy search
- Mannor, S., Rubinstein, R. Y., and Gat, Y. The Cross-Entropy Method for fast policy search. In Proceedings of the Int'l Conference on Machine Learning, 2003.
- Proceedings of the Int'l Conference on Machine Learning, 2003
- Mannor, S.¹ Rubinstein, R.Y.² Gat, Y.³

9
- 84860394829
- Learning cost-efficient control policies with XCSF: Generalization capabilities and further improvement
- Marin, D., Decock, J., Rigoux, L., and Sigaud, O. Learning cost-efficient control policies with XCSF: Generalization capabilities and further improvement. In Proc. of Genetic and evolutionary computation, 2011.
- Proc. of Genetic and Evolutionary Computation, 2011
- Marin, D.¹ Decock, J.² Rigoux, L.³ Sigaud, O.⁴

10
- 40649106649
- Natural actor-critic
- Peters, J. and Schaal, S.. Natural actor-critic. Neurocomputing, 71(7-9):1180-1190, 2008.
- (2008) Neurocomputing , vol.71 , Issue.7-9 , pp. 1180-1190
- Peters, J.¹ Schaal, S.²

11
- 85141643084
- Exploring parameter space in reinforcement learning
- Rückstiess, T., Sehnke, F., Schaul, T., Wierstra, D., Sun, Y., and Schmidhuber, J.. Exploring parameter space in reinforcement learning. Paladyn. Journal of Behavioral Robotics, 1:14-24, 2010.
- (2010) Paladyn. Journal of Behavioral Robotics , vol.1 , pp. 14-24
- Rückstiess, T.¹ Sehnke, F.² Schaul, T.³ Wierstra, D.⁴ Sun, Y.⁵ Schmidhuber, J.⁶

12
- 84870910181
- Learning to grasp under uncertainty
- Stulp, F., Theodorou, E., Buchli, J., and Schaal, S. Learning to grasp under uncertainty. In Proceedings of the Int'l Conference on Robotics and Automation, 2011.
- Proceedings of the Int'l Conference on Robotics and Automation, 2011
- Stulp, F.¹ Theodorou, E.² Buchli, J.³ Schaal, S.⁴

13
- 79551503171
- A generalized path integral control approach to reinforcement learning
- Theodorou, E., Buchli, J., and Schaal, S.. A generalized path integral control approach to reinforcement learning. J. of Machine Learning Research, 11:3137-3181, 2010.
- (2010) J. of Machine Learning Research , vol.11 , pp. 3137-3181
- Theodorou, E.¹ Buchli, J.² Schaal, S.³

14
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams, R. J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.