SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5323 LNAI, Issue , 2008, Pages 136-150

Variable metric reinforcement learning methods applied to the noisy mountain car problem

(2) Heidrich Meisner, Verena a Igel, Christian a

a RUHR UNIVERSITY BOCHUM (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

COVARIANCE MATRIX; EDUCATION; EVOLUTIONARY ALGORITHMS; LANDFORMS; LEARNING ALGORITHMS; REINFORCEMENT LEARNING;

COVARIANCE MATRIX ADAPTATION (CMA); EVOLUTION STRATEGY (ES); REINFORCEMENT LEARNING (RL) METHODS; VARIABLE METRIC;

REINFORCEMENT;

EID: 58449122813 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-89722-4_11 Document Type: Conference Paper

Times cited : (19)

References (23)

1
- 84886993021
- Heidrich-Meisner, V., Igel, C.: Similarities and differences between policy gradient methods and evolution strategies. In: Verleysen, M. (ed.) 16th European Symposium on Artificial Neural Networks (ESANN), Evere, Belgium, pp. 149-154. d-side publications (2008)
- Heidrich-Meisner, V., Igel, C.: Similarities and differences between policy gradient methods and evolution strategies. In: Verleysen, M. (ed.) 16th European Symposium on Artificial Neural Networks (ESANN), Evere, Belgium, pp. 149-154. d-side publications (2008)

2
- 34447553096
- Reinforcement learning for humanoid robotics
- Peters, J., Vijayakumar, S., Schaal, S.: Reinforcement learning for humanoid robotics. In: Proc. 3rd IEEE-RAS Int'l. Conf. on Humanoid Robots, pp. 29-30 (2003)
- (2003) Proc. 3rd IEEE-RAS Int'l. Conf. on Humanoid Robots , pp. 29-30
- Peters, J.¹ Vijayakumar, S.² Schaal, S.³

3
- 34548763245
- Evaluation of policy gradient methods and variants on the cart-pole benchmark
- Riedmiller, M., Peters, J., Schaal, S.: Evaluation of policy gradient methods and variants on the cart-pole benchmark. In: Proc. 2007 IEEE Internatinal Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007), pp. 254-261 (2007)
- (2007) Proc. 2007 IEEE Internatinal Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL , pp. 254-261
- Riedmiller, M.¹ Peters, J.² Schaal, S.³

4
- 84886998125
- Peters, J., Schaal, S.: Applying the episodic natural actor-critic architecture to motor primitive learning. In: Proc. 15th European Symposium on Artificial Neural Networks (ESANN 2007), Evere, Belgium, pp. 1-6. d-side publications (2007)
- Peters, J., Schaal, S.: Applying the episodic natural actor-critic architecture to motor primitive learning. In: Proc. 15th European Symposium on Artificial Neural Networks (ESANN 2007), Evere, Belgium, pp. 1-6. d-side publications (2007)

5
- 40649106649
- Natural actor-critic
- Peters, J., Schaal, S.: Natural actor-critic. Neurocomputing 71(7-9), 1180-1190 (2008)
- (2008) Neurocomputing , vol.71 , Issue.7-9 , pp. 1180-1190
- Peters, J.¹ Schaal, S.²

6
- 33845271655
- Hansen, N.: The CMA evolution strategy: A comparing review. In: Towards a new evolutionary computation. Advances on estimation of distribution algorithms, pp. 75-102. Springer, Heidelberg (2006)
- Hansen, N.: The CMA evolution strategy: A comparing review. In: Towards a new evolutionary computation. Advances on estimation of distribution algorithms, pp. 75-102. Springer, Heidelberg (2006)

7
- 56449128627
- Evolution strategies
- Beyer, H.G.: Evolution strategies. Scholarpedia 2(18), 1965 (2007)
- (2007) Scholarpedia , vol.2 , Issue.18 , pp. 1965
- Beyer, H.G.¹

8
- 84901411269
- Neuroevolution for reinforcement learning using evolution strategies
- IEEE Press, Los Alamitos
- Igel, C.: Neuroevolution for reinforcement learning using evolution strategies. In: Congress on Evolutionary Computation (CEC 2003), vol. 4, pp. 2588-2595. IEEE Press, Los Alamitos (2003)
- (2003) Congress on Evolutionary Computation (CEC , vol.4 , pp. 2588-2595
- Igel, C.¹

9
- 17444408553
- Making driver modeling attractive
- Pellecchia, A., Igel, C., Edelbrunner, J., Schöner, G.: Making driver modeling attractive. IEEE Intelligent Systems 20(2), 8-12 (2005)
- (2005) IEEE Intelligent Systems , vol.20 , Issue.2 , pp. 8-12
- Pellecchia, A.¹ Igel, C.² Edelbrunner, J.³ Schöner, G.⁴

10
- 33750374195
- Gomez, F., Schmidhuber, J., Miikkulainen, R.: Efficient non-linear control through neuroevolution. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS, 4212, pp. 654-662. Springer, Heidelberg (2006)
- Gomez, F., Schmidhuber, J., Miikkulainen, R.: Efficient non-linear control through neuroevolution. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS, vol. 4212, pp. 654-662. Springer, Heidelberg (2006)

11
- 55749091103
- Evolutionary reinforcement learning of artificial neural networks
- Siebel, N.T., Sommer, G.: Evolutionary reinforcement learning of artificial neural networks. International Journal of Hybrid Intelligent Systems 4(3), 171-183 (2007)
- (2007) International Journal of Hybrid Intelligent Systems , vol.4 , Issue.3 , pp. 171-183
- Siebel, N.T.¹ Sommer, G.²

12
- 84887011679
- Kassahun, Y., Sommer, G.: Efficient reinforcement learning through evolutionary acquisition of neural topologies. In: Verleysen, M. (ed.) 13th European Symposium on Artificial Neural Networks, pp. 259-266. d-side (2005)
- Kassahun, Y., Sommer, G.: Efficient reinforcement learning through evolutionary acquisition of neural topologies. In: Verleysen, M. (ed.) 13th European Symposium on Artificial Neural Networks, pp. 259-266. d-side (2005)

13
- 55749088183
- Natural evolution strategies
- IEEE Press, Los Alamitos accepted
- Wierstra, D., Schaul, T., Peters, J., Schmidhuber, J.: Natural evolution strategies. In: Computational Intelligence: Research Frontiers. IEEE Press, Los Alamitos (accepted, 2008)
- (2008) Computational Intelligence: Research Frontiers
- Wierstra, D.¹ Schaul, T.² Peters, J.³ Schmidhuber, J.⁴

14
- 0004102479
- MIT Press, Cambridge
- Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

15
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L., Littman, M., Cassandra, A.: Planning and acting in partially observable stochastic domains. Artificial Intelligence 101(1-2), 99-134 (1998)
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.¹ Littman, M.² Cassandra, A.³

16
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- Sutton, R., McAllester, D., Singh, S., Mansour, Y.: Policy gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems 12, 1057-1063 (2000)
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

17
- 0003502414
- Frommann-Holzboog
- Rechenberg, I.: Evolutionsstrategie: Optimierung Technischer Systeme nach Prinzipien der Biologischen Evolution. Frommann-Holzboog (1973)
- (1973) Evolutionsstrategie: Optimierung Technischer Systeme nach Prinzipien der Biologischen Evolution
- Rechenberg, I.¹

18
- 0003636650
- Evolution and Optimum Seeking
- John Wiley & Sons, Chichester
- Schwefel, H.P.: Evolution and Optimum Seeking. Sixth-Generation Computer Technology Series. John Wiley & Sons, Chichester (1995)
- (1995) Sixth-Generation Computer Technology Series
- Schwefel, H.P.¹

19
- 0037592480
- Evolution strategies: A comprehensive introduction
- Beyer, H.G., Schwefel, H.P.: Evolution strategies: A comprehensive introduction. Natural Computing 1(1), 3-52 (2002)
- (2002) Natural Computing , vol.1 , Issue.1 , pp. 3-52
- Beyer, H.G.¹ Schwefel, H.P.²

20
- 4344707250
- Learning probability distributions in continuous evolutionary algorithms - A comparative review
- Kern, S., Müller, S., Hansen, N., Büche, D., Ocenasek, J., Koumoutsakos, P.: Learning probability distributions in continuous evolutionary algorithms - A comparative review. Natural Computing 3, 77-112 (2004)
- (2004) Natural Computing , vol.3 , pp. 77-112
- Kern, S.¹ Müller, S.² Hansen, N.³ Büche, D.⁴ Ocenasek, J.⁵ Koumoutsakos, P.⁶

21
- 0042879997
- Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES)
- Hansen, N., Müller, S., Koumoutsakos, P.: Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES). Evolutionary Computation 11(1), 1-18 (2003)
- (2003) Evolutionary Computation , vol.11 , Issue.1 , pp. 1-18
- Hansen, N.¹ Müller, S.² Koumoutsakos, P.³

22
- 0035377566
- Completely derandomized self-adaptation in evolution strategies
- Hansen, N., Ostermeier, A.: Completely derandomized self-adaptation in evolution strategies. Evolutionary Computation 9(2), 159-195 (2001)
- (2001) Evolutionary Computation , vol.9 , Issue.2 , pp. 159-195
- Hansen, N.¹ Ostermeier, A.²

23
- 59749091473
- A method for handling uncertainty in evolutionary optimization with an application to feedback control of combustion
- in press
- Hansen, N., Niederberger, A.S.P., Guzzella, L., Koumoutsakos, P.: A method for handling uncertainty in evolutionary optimization with an application to feedback control of combustion. IEEE Transactions on Evolutionary Computation (in press, 2008)
- (2008) IEEE Transactions on Evolutionary Computation
- Hansen, N.¹ Niederberger, A.S.P.² Guzzella, L.³ Koumoutsakos, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.