메뉴 건너뛰기




Volumn 5323 LNAI, Issue , 2008, Pages 136-150

Variable metric reinforcement learning methods applied to the noisy mountain car problem

Author keywords

[No Author keywords available]

Indexed keywords

COVARIANCE MATRIX; EDUCATION; EVOLUTIONARY ALGORITHMS; LANDFORMS; LEARNING ALGORITHMS; REINFORCEMENT LEARNING;

EID: 58449122813     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-89722-4_11     Document Type: Conference Paper
Times cited : (19)

References (23)
  • 1
    • 84886993021 scopus 로고    scopus 로고
    • Heidrich-Meisner, V., Igel, C.: Similarities and differences between policy gradient methods and evolution strategies. In: Verleysen, M. (ed.) 16th European Symposium on Artificial Neural Networks (ESANN), Evere, Belgium, pp. 149-154. d-side publications (2008)
    • Heidrich-Meisner, V., Igel, C.: Similarities and differences between policy gradient methods and evolution strategies. In: Verleysen, M. (ed.) 16th European Symposium on Artificial Neural Networks (ESANN), Evere, Belgium, pp. 149-154. d-side publications (2008)
  • 4
    • 84886998125 scopus 로고    scopus 로고
    • Peters, J., Schaal, S.: Applying the episodic natural actor-critic architecture to motor primitive learning. In: Proc. 15th European Symposium on Artificial Neural Networks (ESANN 2007), Evere, Belgium, pp. 1-6. d-side publications (2007)
    • Peters, J., Schaal, S.: Applying the episodic natural actor-critic architecture to motor primitive learning. In: Proc. 15th European Symposium on Artificial Neural Networks (ESANN 2007), Evere, Belgium, pp. 1-6. d-side publications (2007)
  • 5
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • Peters, J., Schaal, S.: Natural actor-critic. Neurocomputing 71(7-9), 1180-1190 (2008)
    • (2008) Neurocomputing , vol.71 , Issue.7-9 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2
  • 6
    • 33845271655 scopus 로고    scopus 로고
    • Hansen, N.: The CMA evolution strategy: A comparing review. In: Towards a new evolutionary computation. Advances on estimation of distribution algorithms, pp. 75-102. Springer, Heidelberg (2006)
    • Hansen, N.: The CMA evolution strategy: A comparing review. In: Towards a new evolutionary computation. Advances on estimation of distribution algorithms, pp. 75-102. Springer, Heidelberg (2006)
  • 7
    • 56449128627 scopus 로고    scopus 로고
    • Evolution strategies
    • Beyer, H.G.: Evolution strategies. Scholarpedia 2(18), 1965 (2007)
    • (2007) Scholarpedia , vol.2 , Issue.18 , pp. 1965
    • Beyer, H.G.1
  • 8
    • 84901411269 scopus 로고    scopus 로고
    • Neuroevolution for reinforcement learning using evolution strategies
    • IEEE Press, Los Alamitos
    • Igel, C.: Neuroevolution for reinforcement learning using evolution strategies. In: Congress on Evolutionary Computation (CEC 2003), vol. 4, pp. 2588-2595. IEEE Press, Los Alamitos (2003)
    • (2003) Congress on Evolutionary Computation (CEC , vol.4 , pp. 2588-2595
    • Igel, C.1
  • 10
    • 33750374195 scopus 로고    scopus 로고
    • Gomez, F., Schmidhuber, J., Miikkulainen, R.: Efficient non-linear control through neuroevolution. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS, 4212, pp. 654-662. Springer, Heidelberg (2006)
    • Gomez, F., Schmidhuber, J., Miikkulainen, R.: Efficient non-linear control through neuroevolution. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS, vol. 4212, pp. 654-662. Springer, Heidelberg (2006)
  • 12
    • 84887011679 scopus 로고    scopus 로고
    • Kassahun, Y., Sommer, G.: Efficient reinforcement learning through evolutionary acquisition of neural topologies. In: Verleysen, M. (ed.) 13th European Symposium on Artificial Neural Networks, pp. 259-266. d-side (2005)
    • Kassahun, Y., Sommer, G.: Efficient reinforcement learning through evolutionary acquisition of neural topologies. In: Verleysen, M. (ed.) 13th European Symposium on Artificial Neural Networks, pp. 259-266. d-side (2005)
  • 15
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L., Littman, M., Cassandra, A.: Planning and acting in partially observable stochastic domains. Artificial Intelligence 101(1-2), 99-134 (1998)
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.1    Littman, M.2    Cassandra, A.3
  • 19
    • 0037592480 scopus 로고    scopus 로고
    • Evolution strategies: A comprehensive introduction
    • Beyer, H.G., Schwefel, H.P.: Evolution strategies: A comprehensive introduction. Natural Computing 1(1), 3-52 (2002)
    • (2002) Natural Computing , vol.1 , Issue.1 , pp. 3-52
    • Beyer, H.G.1    Schwefel, H.P.2
  • 20
    • 4344707250 scopus 로고    scopus 로고
    • Learning probability distributions in continuous evolutionary algorithms - A comparative review
    • Kern, S., Müller, S., Hansen, N., Büche, D., Ocenasek, J., Koumoutsakos, P.: Learning probability distributions in continuous evolutionary algorithms - A comparative review. Natural Computing 3, 77-112 (2004)
    • (2004) Natural Computing , vol.3 , pp. 77-112
    • Kern, S.1    Müller, S.2    Hansen, N.3    Büche, D.4    Ocenasek, J.5    Koumoutsakos, P.6
  • 21
    • 0042879997 scopus 로고    scopus 로고
    • Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES)
    • Hansen, N., Müller, S., Koumoutsakos, P.: Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES). Evolutionary Computation 11(1), 1-18 (2003)
    • (2003) Evolutionary Computation , vol.11 , Issue.1 , pp. 1-18
    • Hansen, N.1    Müller, S.2    Koumoutsakos, P.3
  • 22
    • 0035377566 scopus 로고    scopus 로고
    • Completely derandomized self-adaptation in evolution strategies
    • Hansen, N., Ostermeier, A.: Completely derandomized self-adaptation in evolution strategies. Evolutionary Computation 9(2), 159-195 (2001)
    • (2001) Evolutionary Computation , vol.9 , Issue.2 , pp. 159-195
    • Hansen, N.1    Ostermeier, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.