SCOPUS 정보 검색 플랫폼

Proceedings of the 11th Annual Genetic and Evolutionary Computation Conference, GECCO-2009

Volumn , Issue , 2009, Pages 405-412

EDA-RL: Estimation of distribution algorithms for reinforcement learning problems

(1) Handa, Hisashi a

a OKAYAMA UNIVERSITY (Japan)

Author keywords

Conditional random fields; Estimation of distribution algorithms; Reinforcement learning problems

Indexed keywords

CONDITIONAL PROBABILITIES; CONDITIONAL PROBABILITY DISTRIBUTIONS; CONDITIONAL RANDOM FIELD; CONVENTIONAL REINFORCEMENT LEARNING; ESTIMATION OF DISTRIBUTION ALGORITHMS; EVOLUTIONARY COMPUTATIONS; INPUT-OUTPUT DATA; MAZE PROBLEMS; PERCEPTUAL ALIASING; PROBABILISTIC DISTRIBUTION; PROBABILISTIC MODELS; TRANSITION PROBLEMS;

AUTONOMOUS AGENTS; CALCULATIONS; CONTENT BASED RETRIEVAL; EDUCATION; ESTIMATION; EVOLUTIONARY ALGORITHMS; IMAGE SEGMENTATION; PROBABILITY DISTRIBUTIONS; REINFORCEMENT; REINFORCEMENT LEARNING; SIMULATORS;

LEARNING ALGORITHMS;

EID: 72749094980 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1569901.1569958 Document Type: Conference Paper

Times cited : (18)

References (12)

1
- 33750249654
- Studying XCS/BOA learning in boolean functions: Structure encoding and random boolean functions
- M. V. Butz and M. Pelikan. Studying XCS/BOA learning in boolean functions: structure encoding and random boolean functions. In Proc. of the 2006 Genetic and Evol. Gomput. Conf., pages 1449-1456, 2006.
- (2006) Proc. of the 2006 Genetic and Evol. Gomput. Conf , pp. 1449-1456
- Butz, M.V.¹ Pelikan, M.²

2
- 55249089447
- Evolutionary fuzzy systems for generating better Ms.PacMan players
- H. Handa and M. Isozaki. Evolutionary fuzzy systems for generating better Ms.PacMan players. 2008 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE'08), pages 2182-2185, 2008.
- (2008) 2008 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE'08) , pp. 2182-2185
- Handa, H.¹ Isozaki, M.²

3
- 72749098743
- J. Lafferty. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of 18th International Conference on Machine Learning, pages 282-289. Morgan Kaufmarm, 2001.
- J. Lafferty. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of 18th International Conference on Machine Learning, pages 282-289. Morgan Kaufmarm, 2001.

4
- 0041481252
- Kluwer Academic Publishers, Norwell, MA, USA
- P. Larrañaga and J. A. Lozano. Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation. Kluwer Academic Publishers, Norwell, MA, USA, 2001.
- (2001) Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation
- Larrañaga, P.¹ Lozano, J.A.²

5
- 0033258156
- FDA -a scalable evolutionary algorithm for the optimization of additively decomposed functions
- H. Mühlenbein and T. Mahnig. FDA -a scalable evolutionary algorithm for the optimization of additively decomposed functions. Evol. Comput., 7(4):353-376, 1999.
- (1999) Evol. Comput , vol.7 , Issue.4 , pp. 353-376
- Mühlenbein, H.¹ Mahnig, T.²

6
- 72749105995
- A genetic algorithm for automatically designing modular reinforcement learning agents
- I. Ono, T. Nijo, and N. Ono. A genetic algorithm for automatically designing modular reinforcement learning agents. In Proc. of the Genetic and Evol. Comput. Conf., pages 203-210, 2000.
- (2000) Proc. of the Genetic and Evol. Comput. Conf , pp. 203-210
- Ono, I.¹ Nijo, T.² Ono, N.³

7
- 15544373328
- Estimation of distribution algorithms with Kikuchi approximations
- R. Santana. Estimation of distribution algorithms with Kikuchi approximations. Evol. Comput., 13(1):67-97, 2005.
- (2005) Evol. Comput , vol.13 , Issue.1 , pp. 67-97
- Santana, R.¹

8
- 27144449634
- Incorporating a metropolis method in a distribution estimation using markov random field algorithm
- S. K. Shakya, J. A. W. McCall, and D. F. Brown. Incorporating a metropolis method in a distribution estimation using markov random field algorithm. In Proc. of 2005 IEEE Congress on Evol. Comput., volume 3, pages 2576-2583, 2005.
- (2005) Proc. of 2005 IEEE Congress on Evol. Comput , vol.3 , pp. 2576-2583
- Shakya, S.K.¹ McCall, J.A.W.² Brown, D.F.³

9
- 32444439999
- Using a markov network model in a univariate EDA: An emperical cost-benefit analysis
- S. K. Shakya, J. A. W. McCall, and D. F. Brown. Using a markov network model in a univariate EDA: An emperical cost-benefit analysis. In Proc. of 2005 Genetic and Evol. Comput. Conf., pages 727-734, 2005.
- (2005) Proc. of 2005 Genetic and Evol. Comput. Conf , pp. 727-734
- Shakya, S.K.¹ McCall, J.A.W.² Brown, D.F.³

10
- 33750032384
- An introduction to conditional random fields for relational learning
- L. Getoor and B. Taskar, editors, chapter 4, MIT Press, Cambridge, MA
- C. Sutton and A. Mccallum. An introduction to conditional random fields for relational learning. In L. Getoor and B. Taskar, editors, Introduction to Statistical Relational Learning, chapter 4, pages 93-128. MIT Press, Cambridge, MA, 2007.
- (2007) Introduction to Statistical Relational Learning , pp. 93-128
- Sutton, C.¹ Mccallum, A.²

11
- 0004102479
- MIT Press, Cambridge, MA
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

12
- 34249833101
- C. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 08:279 292(14), May 1992.
- C. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 08:279 292(14), May 1992.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.