SCOPUS 정보 검색 플랫폼

Proceedings of the 11th Annual Genetic and Evolutionary Computation Conference, GECCO-2009

Volumn , Issue , 2009, Pages 1211-1218

Uncertainty handling CMA-ES for reinforcement learning

(2) Heidrich Meisner, Verena a Igel, Christian a

a RUHR UNIVERSITY BOCHUM (Germany)

Author keywords

Covariance matrix adaptation evolution strategy; Direct policy search; Reinforcement learning; Uncertainty handling

Indexed keywords

COVARIANCE MATRIX ADAPTATION EVOLUTION STRATEGIES; DIRECT POLICY SEARCH; EVOLUTIONARY LEARNING; LEARNING SPEED; NOISY OBSERVATIONS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; POLICY GRADIENT METHODS; RANDOM SEARCHES; UNCERTAINTY HANDLING;

EDUCATION; EVOLUTIONARY ALGORITHMS; GRADIENT METHODS; LEARNING ALGORITHMS; MARKOV PROCESSES; REINFORCEMENT; REINFORCEMENT LEARNING; SIGNAL TO NOISE RATIO;

COVARIANCE MATRIX;

EID: 72749104931 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1569901.1570064 Document Type: Conference Paper

Times cited : (15)

References (28)

1
- 72749092976
- S. Amari and H. Nagaoka. Methods of Information Geometry. Number 191 in Translations of Mathematical Monographs. American Mathematical Society and Oxford University Press, 2000.
- S. Amari and H. Nagaoka. Methods of Information Geometry. Number 191 in Translations of Mathematical Monographs. American Mathematical Society and Oxford University Press, 2000.

2
- 0013078349
- Kluwer Academic Publishers
- D. V. Arnold. Noisy Optimization With Evolution Strategies. Kluwer Academic Publishers, 2002.
- (2002) Noisy Optimization With Evolution Strategies
- Arnold, D.V.¹

3
- 56449128627
- Evolution strategies
- H.-G. Beyer. Evolution strategies. Scholarpedia, 2(8):1965, 2007.
- (1965) Scholarpedia , vol.2 , Issue.8
- Beyer, H.-G.¹

4
- 33746327619
- An alternative simulation budget allocation scheme for efficient simulation
- 49-57
- C. Chen and E. Yucesan. An alternative simulation budget allocation scheme for efficient simulation. International Journal of Simulation and Process Modeling, 1(1):49-57, 2005.
- (2005) International Journal of Simulation and Process Modeling , vol.1 , Issue.1
- Chen, C.¹ Yucesan, E.²

5
- 34548132703
- These de doctorat, Institut National Polytechnique de Grenoble
- R. Coulom. Apprentissage par renforcement utilisant des reseaux de neurones, avec des applications au controle moteur. These de doctorat, Institut National Polytechnique de Grenoble, 2002.
- (2002) Apprentissage par renforcement utilisant des reseaux de neurones, avec des applications au controle moteur
- Coulom, R.¹

6
- 56449125243
- Uncertainty handling in model selection for support vector machines
- G. Rudolph, editor, Parallel Problem Solving from Nature PPSN X, of, Springer-Verlag
- T. Glasmachers and C. Igel. Uncertainty handling in model selection for support vector machines. In G. Rudolph, editor, Parallel Problem Solving from Nature (PPSN X), volume 5199 of LNCS, pages 185-194. Springer-Verlag, 2008.
- (2008) LNCS , vol.5199 , pp. 185-194
- Glasmachers, T.¹ Igel, C.²

7
- 44649193889
- Accelerated neural evolution through cooperatively coevolved synapses
- F. Gomez, J. Schmidhuber, and R. Miikkulainen. Accelerated neural evolution through cooperatively coevolved synapses. Journal of Machine Learning Research, 9:937-965, 2008.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 937-965
- Gomez, F.¹ Schmidhuber, J.² Miikkulainen, R.³

8
- 0042879997
- Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES)
- N. Hansen, S. D. Müller, and P. Koumoutsakos. Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES). Evolutionary Computation, 11(1):1-18, 2003.
- (2003) Evolutionary Computation , vol.11 , Issue.1 , pp. 1-18
- Hansen, N.¹ Müller, S.D.² Koumoutsakos, P.³

9
- 56449130836
- Evolutionary optimization of feedback controllers for thermoacoustic instabilities
- J. F. Morrison, D. M. Birch, and P. Lavoie, editors, Springer-Verlag
- N. Hansen, A. S. P. Niederberger, L. Guzzella, and P. Koumoutsakos. Evolutionary optimization of feedback controllers for thermoacoustic instabilities. In J. F. Morrison, D. M. Birch, and P. Lavoie, editors, IUTAM Symposium on Flow Control and MEMS. Springer-Verlag, 2008.
- (2008) IUTAM Symposium on Flow Control and MEMS
- Hansen, N.¹ Niederberger, A.S.P.² Guzzella, L.³ Koumoutsakos, P.⁴

10
- 59749085404
- A method for handling uncertainty in evolutionary optimization with an application to feedback control of combustion
- N. Hansen, A. S. P. Niederberger, L. Guzzella, and P. Koumoutsakos. A method for handling uncertainty in evolutionary optimization with an application to feedback control of combustion. IEEE Transactions on Evolutionary Computation, 13(1):180-197, 2009.
- (2009) IEEE Transactions on Evolutionary Computation , vol.13 , Issue.1 , pp. 180-197
- Hansen, N.¹ Niederberger, A.S.P.² Guzzella, L.³ Koumoutsakos, P.⁴

11
- 0035377566
- Completely derandomized self-adaptation in evolution strategies
- N. Hansen and A. Ostermeier. Completely derandomized self-adaptation in evolution strategies. Evolutionary Computation, 9(2):159-195, 2001.
- (2001) Evolutionary Computation , vol.9 , Issue.2 , pp. 159-195
- Hansen, N.¹ Ostermeier, A.²

12
- 56449106904
- Evolution strategies for direct policy search
- G. Rudolph, editor, Parallel Problem Solving from Nature PPSN X, number in, Springer-Verlag
- V. Heidrich-Meisner and C. Igel. Evolution strategies for direct policy search. In G. Rudolph, editor, Parallel Problem Solving from Nature (PPSN X), number 5199 in LNCS, pages 428-437. Springer-Verlag, 2008.
- (2008) LNCS , vol.5199 , pp. 428-437
- Heidrich-Meisner, V.¹ Igel, C.²

13
- 72749121896
- Uncertainty handling in evolutionary direct policy search
- Y. Engel, M. Ghavamzadeh, P. Poupart, and S. Mannor, editors
- V. Heidrich-Meisner and C. Igel. Uncertainty handling in evolutionary direct policy search. In Y. Engel, M. Ghavamzadeh, P. Poupart, and S. Mannor, editors, NIPS-08 Workshop on Model Uncertainty and Risk in Reinforcement Learning. 2008.
- (2008) NIPS-08 Workshop on Model Uncertainty and Risk in Reinforcement Learning
- Heidrich-Meisner, V.¹ Igel, C.²

14
- 58449122813
- Variable metric reinforcement learning methods applied to the noisy mountain car problem
- S. Girgin et al, editors, European Workshop on Reinforcement Learning EWRL 2008, number in, Springer-Verlag
- V. Heidrich-Meisner and C. Igel. Variable metric reinforcement learning methods applied to the noisy mountain car problem. In S. Girgin et al., editors, European Workshop on Reinforcement Learning (EWRL 2008), number 5323 in LNAI, pages 136-150. Springer-Verlag, 2008.
- (2008) LNAI , vol.5323 , pp. 136-150
- Heidrich-Meisner, V.¹ Igel, C.²

15
- 84901411269
- Neuroevolution for reinforcement learning using evolution strategies
- IEEE Press
- C. Igel. Neuroevolution for reinforcement learning using evolution strategies. In Congress on Evolutionary Computation (CEC 2003), volume 4, pages 2588-2595. IEEE Press, 2003.
- (2003) Congress on Evolutionary Computation (CEC 2003) , vol.4 , pp. 2588-2595
- Igel, C.¹

16
- 46249090365
- Shark
- C. Igel, T. Glasmachers, and V. Heidrich-Meisner. Shark. Journal of Machine Learning Research, 9:993-996, 2008.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 993-996
- Igel, C.¹ Glasmachers, T.² Heidrich-Meisner, V.³

17
- 0032073263
- Planning and acting in partially observable stochastic domains
- L. Kaelbling, M. Littman, and A. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2):99-134, 1998.
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.¹ Littman, M.² Cassandra, A.³

18
- 84898930479
- A natural policy gradient
- T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, MIT Press
- S. Kakade. A natural policy gradient. In T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, Advances in Neural Information Processing Systems (NIPS14). MIT Press, 2002.
- (2002) Advances in Neural Information Processing Systems (NIPS14)
- Kakade, S.¹

19
- 17444408553
- Making driver modeling attractive
- A. Pellecchia, C. Igel, J. Edelbrunner, and G. Schöner. Making driver modeling attractive. IEEE Intelligent Systems, 20(2):8-12, 2005.
- (2005) IEEE Intelligent Systems , vol.20 , Issue.2 , pp. 8-12
- Pellecchia, A.¹ Igel, C.² Edelbrunner, J.³ Schöner, G.⁴

20
- 40649106649
- Natural actor-critic
- J. Peters and S. Schaal. Natural actor-critic. Neurocomputing, 71(7-9):1180-1190, 2008.
- (2008) Neurocomputing , vol.71 , Issue.7-9 , pp. 1180-1190
- Peters, J.¹ Schaal, S.²

21
- 34447553096
- Reinforcement learning for humanoid robotics
- J. Peters, S. Vijayakumar, and S. Schaal. Reinforcement learning for humanoid robotics. In Proc. 3rd IEEE-RAS Int'l Conf. on Humanoid Robots, pages 29-30, 2003.
- (2003) Proc. 3rd IEEE-RAS Int'l Conf. on Humanoid Robots , pp. 29-30
- Peters, J.¹ Vijayakumar, S.² Schaal, S.³

22
- 34548763245
- Evaluation of policy gradient methods and variants on the cart-pole benchmark
- M. Riedmiller, J. Peters, and S. Schaal. Evaluation of policy gradient methods and variants on the cart-pole benchmark. In Proc. IEEE Int'l Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007), pages 254-261, 2007.
- (2007) Proc. IEEE Int'l Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007) , pp. 254-261
- Riedmiller, M.¹ Peters, J.² Schaal, S.³

23
- 33745783272
- Integrating techniques from statistical ranking into evolutionary algorithms
- Applications of Evolutionary Computing, of, Springer
- C. Schmidt, J. Branke, and S. Chick. Integrating techniques from statistical ranking into evolutionary algorithms. In Applications of Evolutionary Computing, volume 3907 of LNCS, pages 752-763. Springer, 2006.
- (2006) LNCS , vol.3907 , pp. 752-763
- Schmidt, C.¹ Branke, J.² Chick, S.³

24
- 55749091103
- Evolutionary reinforcement learning of artificial neural networks
- N. T. Siebel and G. Sommer. Evolutionary reinforcement learning of artificial neural networks. International Journal of Hybrid Intelligent Systems, 4(3):171-183, 2007.
- (2007) International Journal of Hybrid Intelligent Systems , vol.4 , Issue.3 , pp. 171-183
- Siebel, N.T.¹ Sommer, G.²

25
- 0004102479
- MIT Press
- R. Sutton and A. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

26
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- R. Sutton, D. McAllester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems, volume 12, pages 1057-1063, 2000.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

27
- 62149099100
- Efficient covariance matrix update for variable metric evolution strategies
- T. Suttorp, N. Hansen, and C. Igel. Efficient covariance matrix update for variable metric evolution strategies. Machine Learning, 75(2):167-197, 2009.
- (2009) Machine Learning , vol.75 , Issue.2 , pp. 167-197
- Suttorp, T.¹ Hansen, N.² Igel, C.³

28
- 33646714634
- Evolutionary function approximation for reinforcement learning
- S. Whiteson and P. Stone. Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research, 7:877-917, 2006.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 877-917
- Whiteson, S.¹ Stone, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.