메뉴 건너뛰기




Volumn 6624 LNCS, Issue PART 1, 2011, Pages 153-162

Upper confidence trees with short term partial information

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SCIENCE; COMPUTERS;

EID: 79955805743     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-20525-5_16     Document Type: Conference Paper
Times cited : (26)

References (14)
  • 3
    • 77956556555 scopus 로고    scopus 로고
    • Multi-agent learning experiments on repeated matrix games
    • Bouzy, B., Métivier, M.: Multi-agent learning experiments on repeated matrix games. In: ICML, pp. 119-126 (2010)
    • (2010) ICML , pp. 119-126
    • Bouzy, B.1    Métivier, M.2
  • 4
    • 38049037928 scopus 로고    scopus 로고
    • Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search
    • Ciancarini, P., van den Herik, H.J. (eds.) CG 2006. Springer, Heidelberg
    • Coulom, R.: Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search. In: Ciancarini, P., van den Herik, H.J. (eds.) CG 2006. LNCS, vol. 4630, pp. 72-83. Springer, Heidelberg (2007)
    • (2007) LNCS , vol.4630 , pp. 72-83
    • Coulom, R.1
  • 5
    • 0006630130 scopus 로고
    • A sublinear-time randomized approximation algorithm for matrix games
    • Grigoriadis, M.D., Khachiyan, L.G.: A sublinear-time randomized approximation algorithm for matrix games. Operations Research Letters 18(2), 53-58 (1995)
    • (1995) Operations Research Letters , vol.18 , Issue.2 , pp. 53-58
    • Grigoriadis, M.D.1    Khachiyan, L.G.2
  • 7
    • 33750293964 scopus 로고    scopus 로고
    • Bandit based monte-carlo planning
    • Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. Springer, Heidelberg
    • Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282-293. Springer, Heidelberg (2006)
    • (2006) LNCS (LNAI) , vol.4212 , pp. 282-293
    • Kocsis, L.1    Szepesvári, C.2
  • 8
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985)
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.1    Robbins, H.2
  • 10
    • 0037840849 scopus 로고    scopus 로고
    • On the undecidability of probabilistic planning and related stochastic optimization problems
    • Madani, O., Hanks, S., Condon, A.: On the undecidability of probabilistic planning and related stochastic optimization problems. Artif. Intell. 147(1-2), 5-34 (2003)
    • (2003) Artif. Intell. , vol.147 , Issue.1-2 , pp. 5-34
    • Madani, O.1    Hanks, S.2    Condon, A.3
  • 11
    • 0001205548 scopus 로고    scopus 로고
    • Complexity of finite-horizon markov decision process problems
    • Mundhenk, M., Goldsmith, J., Lusena, C., Allender, E.: Complexity of finite-horizon markov decision process problems. J. ACM 47(4), 681-720 (2000)
    • (2000) J. ACM , vol.47 , Issue.4 , pp. 681-720
    • Mundhenk, M.1    Goldsmith, J.2    Lusena, C.3    Allender, E.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.