메뉴 건너뛰기




Volumn 4212 LNAI, Issue , 2006, Pages 282-293

Bandit based Monte-Carlo planning

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; BOUNDARY VALUE PROBLEMS; ERROR ANALYSIS; LEARNING ALGORITHMS; LEARNING SYSTEMS; MONTE CARLO METHODS;

EID: 33750293964     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11871842_29     Document Type: Conference Paper
Times cited : (2564)

References (14)
  • 1
    • 0036568025 scopus 로고    scopus 로고
    • Finite time analysis of the multiarmed bandit problem
    • P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002.
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 5
    • 24944458186 scopus 로고    scopus 로고
    • Monte Carlo Go developments
    • H.J. van den Herik, H. Iida, and E.A. Heinz, editors
    • B. Bouzy and B. Helmstetter. Monte Carlo Go developments. In H.J. van den Herik, H. Iida, and E.A. Heinz, editors, Advances in Computer Games 10, pages 159-174, 2004.
    • (2004) Advances in Computer Games , vol.10 , pp. 159-174
    • Bouzy, B.1    Helmstetter, B.2
  • 6
    • 14644444172 scopus 로고    scopus 로고
    • An adaptive sampling algorithm for solving Markov decision processes
    • H.S. Chang, M. Fu, J. Hu, and S.I. Marcus. An adaptive sampling algorithm for solving Markov decision processes. Operations Research, 53(1):126-139, 2005.
    • (2005) Operations Research , vol.53 , Issue.1 , pp. 126-139
    • Chang, H.S.1    Fu, M.2    Hu, J.3    Marcus, S.I.4
  • 8
    • 84880649215 scopus 로고    scopus 로고
    • A sparse sampling algorithm for nearoptimal planning in large Markovian decisi on processes
    • M. Kearns, Y. Mansour, and A.Y. Ng. A sparse sampling algorithm for nearoptimal planning in large Markovian decisi on processes. In Proceedings of IJCAI'99, pages 1324-1331, 1999.
    • (1999) Proceedings of IJCAI'99 , pp. 1324-1331
    • Kearns, M.1    Mansour, Y.2    Ng, A.Y.3
  • 9
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • T.L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 10
    • 85017333287 scopus 로고    scopus 로고
    • On-line search for solving Markov decision processes via heuristic sampling
    • R.L. de Mántaras and L. Saitta, editors
    • L. Péret and F. Garcia. On-line search for solving Markov decision processes via heuristic sampling. In R.L. de Mántaras and L. Saitta, editors, ECAI, pages 530-534, 2004.
    • (2004) ECAI , pp. 530-534
    • Péret, L.1    Garcia, F.2
  • 11
    • 0036146034 scopus 로고    scopus 로고
    • World-championship-caliber Scrabble
    • B. Sheppard. World-championship-caliber Scrabble. Artificial Intelligence, 134(1-2):241-275, 2002.
    • (2002) Artificial Intelligence , vol.134 , Issue.1-2 , pp. 241-275
    • Sheppard, B.1
  • 12
    • 0028604197 scopus 로고
    • An analysis of forward pruning
    • S.J.J. Smith and D.S. Nau. An analysis of forward pruning. In AAAI, pages 1386-1391, 1994.
    • (1994) AAAI , pp. 1386-1391
    • Smith, S.J.J.1    Nau, D.S.2
  • 13
    • 84898992015 scopus 로고    scopus 로고
    • On-line policy improvement using Monte-Carlo search
    • M.C. Mozer, M.I. Jordan, and T. Petsche, editors
    • G. Tesauro and G.R. Galperin. On-line policy improvement using Monte-Carlo search. In M.C. Mozer, M.I. Jordan, and T. Petsche, editors, NIPS 9, pages 1068-1074, 1997.
    • (1997) NIPS , vol.9 , pp. 1068-1074
    • Tesauro, G.1    Galperin, G.R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.