메뉴 건너뛰기




Volumn , Issue , 2009, Pages 405-412

EDA-RL: Estimation of distribution algorithms for reinforcement learning problems

Author keywords

Conditional random fields; Estimation of distribution algorithms; Reinforcement learning problems

Indexed keywords

CONDITIONAL PROBABILITIES; CONDITIONAL PROBABILITY DISTRIBUTIONS; CONDITIONAL RANDOM FIELD; CONVENTIONAL REINFORCEMENT LEARNING; ESTIMATION OF DISTRIBUTION ALGORITHMS; EVOLUTIONARY COMPUTATIONS; INPUT-OUTPUT DATA; MAZE PROBLEMS; PERCEPTUAL ALIASING; PROBABILISTIC DISTRIBUTION; PROBABILISTIC MODELS; TRANSITION PROBLEMS;

EID: 72749094980     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1569901.1569958     Document Type: Conference Paper
Times cited : (18)

References (12)
  • 1
    • 33750249654 scopus 로고    scopus 로고
    • Studying XCS/BOA learning in boolean functions: Structure encoding and random boolean functions
    • M. V. Butz and M. Pelikan. Studying XCS/BOA learning in boolean functions: structure encoding and random boolean functions. In Proc. of the 2006 Genetic and Evol. Gomput. Conf., pages 1449-1456, 2006.
    • (2006) Proc. of the 2006 Genetic and Evol. Gomput. Conf , pp. 1449-1456
    • Butz, M.V.1    Pelikan, M.2
  • 3
    • 72749098743 scopus 로고    scopus 로고
    • J. Lafferty. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of 18th International Conference on Machine Learning, pages 282-289. Morgan Kaufmarm, 2001.
    • J. Lafferty. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of 18th International Conference on Machine Learning, pages 282-289. Morgan Kaufmarm, 2001.
  • 5
    • 0033258156 scopus 로고    scopus 로고
    • FDA -a scalable evolutionary algorithm for the optimization of additively decomposed functions
    • H. Mühlenbein and T. Mahnig. FDA -a scalable evolutionary algorithm for the optimization of additively decomposed functions. Evol. Comput., 7(4):353-376, 1999.
    • (1999) Evol. Comput , vol.7 , Issue.4 , pp. 353-376
    • Mühlenbein, H.1    Mahnig, T.2
  • 6
    • 72749105995 scopus 로고    scopus 로고
    • A genetic algorithm for automatically designing modular reinforcement learning agents
    • I. Ono, T. Nijo, and N. Ono. A genetic algorithm for automatically designing modular reinforcement learning agents. In Proc. of the Genetic and Evol. Comput. Conf., pages 203-210, 2000.
    • (2000) Proc. of the Genetic and Evol. Comput. Conf , pp. 203-210
    • Ono, I.1    Nijo, T.2    Ono, N.3
  • 7
    • 15544373328 scopus 로고    scopus 로고
    • Estimation of distribution algorithms with Kikuchi approximations
    • R. Santana. Estimation of distribution algorithms with Kikuchi approximations. Evol. Comput., 13(1):67-97, 2005.
    • (2005) Evol. Comput , vol.13 , Issue.1 , pp. 67-97
    • Santana, R.1
  • 8
    • 27144449634 scopus 로고    scopus 로고
    • Incorporating a metropolis method in a distribution estimation using markov random field algorithm
    • S. K. Shakya, J. A. W. McCall, and D. F. Brown. Incorporating a metropolis method in a distribution estimation using markov random field algorithm. In Proc. of 2005 IEEE Congress on Evol. Comput., volume 3, pages 2576-2583, 2005.
    • (2005) Proc. of 2005 IEEE Congress on Evol. Comput , vol.3 , pp. 2576-2583
    • Shakya, S.K.1    McCall, J.A.W.2    Brown, D.F.3
  • 9
  • 10
    • 33750032384 scopus 로고    scopus 로고
    • An introduction to conditional random fields for relational learning
    • L. Getoor and B. Taskar, editors, chapter 4, MIT Press, Cambridge, MA
    • C. Sutton and A. Mccallum. An introduction to conditional random fields for relational learning. In L. Getoor and B. Taskar, editors, Introduction to Statistical Relational Learning, chapter 4, pages 93-128. MIT Press, Cambridge, MA, 2007.
    • (2007) Introduction to Statistical Relational Learning , pp. 93-128
    • Sutton, C.1    Mccallum, A.2
  • 12
    • 34249833101 scopus 로고    scopus 로고
    • C. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 08:279 292(14), May 1992.
    • C. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 08:279 292(14), May 1992.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.