SCOPUS 정보 검색 플랫폼

Volumn 1, Issue , 2002, Pages 151-156

Step size adaptation in evolution strategies using reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING;

ADAPTATION PARAMETERS; ADAPTIVE SCHEME; EVOLUTION STRATEGIES; STEP SIZE; STEP-SIZE ADAPTATIONS; TEST CASE; TEST FUNCTIONS;

EVOLUTIONARY ALGORITHMS;

EID: 36348992992 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CEC.2002.1006225 Document Type: Conference Paper

Times cited : (39)

References (11)

1
- 0003502414
- Fromann-Holzboog, Stuttgart
- Rechenberg, I., "Evolutionsstrategie: Optimierung technischer Systeme nach Prinzipien der biolo-gischen Evolution," Fromann-Holzboog, Stuttgart, 1973.
- (1973) Evolutionsstrategie: Optimierung Technischer Systeme Nach Prinzipien der Biolo-gischen Evolution
- Rechenberg, I.¹

2
- 0003827650
- Fromann-Holzboog, Stuttgart
- Rechenberg, I., "Evolutionsstrategie '94," Fromann-Holzboog, Stuttgart, 1994.
- (1994) Evolutionsstrategie '94
- Rechenberg, I.¹

3
- 0003435075
- Oxford University Press
- Back, Th.: "Evolutionary Algorithms in Theory and Practice," Oxford University Press, 1996.
- (1996) Evolutionary Algorithms in Theory and Practice
- Th, B.¹

4
- 0029722015
- Adapting arbitrary normal mutation distributions in evolution strategies: The covariance matrix adaptation
- Hansen, N., Ostermeier, A., "Adapting Arbitrary Normal Mutation Distributions in Evolution Strategies: The Covariance Matrix Adaptation," Proceedings of the IEEE International Conference on Evolutionary Computation (ICEC'96),pp. 312-317, 1996.
- (1996) Proceedings of the IEEE International Conference on Evolutionary Computation (ICEC'96) , pp. 312-317
- Hansen, N.¹ Ostermeier, A.²

5
- 0002640882
- i,λ)-cma-es
- Hansen, N., Ostermeier, A., "Convergence Properties of Evolution Strategies with the Derandomized Co-variance Matrix Adaptation: The (μ/ μi,Λ)-CMA-ES," Proceedings of the 5th European Congress on Intelligent Techniques and Soft Computing (EU-FIT'97),pp. 650-654, 1997.
- (1997) Proceedings of the 5th European Congress on Intelligent Techniques and Soft Computing (EU-FIT'97) , pp. 650-654
- Hansen, N.¹ Ostermeier, A.²

7
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro, G., "Temporal difference learning and TD-Gammon," Communications of the ACM, 38(3), pp.58-68, 1995.
- (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

8
- 0004102479
- MIT Press, Cambridge
- Sutton, R.S., Barto, A.G., "Reinforcement Learning-An Introduction," MIT Press, Cambridge, 1998.
- (1998) Reinforcement Learning-An Introduction
- Sutton, R.S.¹ Barto, A.G.²

9
- 0003636650
- John Wiley and Sons, New York
- Schwefel, H.-P., "Evolution and Optimum Seeking," John Wiley and Sons, New York, 1995.
- (1995) Evolution and Optimum Seeking
- Schwefel, H.-P.¹

10
- 20444380868
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Singh, S., Jaakkola, T., Littman, M.L., Szpes-vari, C, "Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms," Machine Learning, 1999.
- (1999) Machine Learning
- Singh, S.¹ Jaakkola, T.² Littman, M.L.³ Szpesvari, C.⁴

11
- 0004255908
- McGraw-Hill
- Mitchell, T.M., "Machine Learning," McGraw-Hill, 1997.
- (1997) Machine Learning
- Mitchell, T.M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.