메뉴 건너뛰기




Volumn 6024 LNCS, Issue PART 1, 2010, Pages 201-210

Multiple overlapping tiles for contextual Monte Carlo tree search

Author keywords

[No Author keywords available]

Indexed keywords

TREES (MATHEMATICS);

EID: 77952337886     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-12239-2_21     Document Type: Conference Paper
Times cited : (12)

References (12)
  • 1
    • 33750293964 scopus 로고    scopus 로고
    • Bandit-based monte-carlo planning
    • Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. Springer, Heidelberg
    • Kocsis, L., Szepesvari, C.: Bandit-based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol.4212, pp. 282-293. Springer, Heidelberg (2006)
    • (2006) LNCS (LNAI) , vol.4212 , pp. 282-293
    • Kocsis, L.1    Szepesvari, C.2
  • 8
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R.S.: Learning to predict by the methods of temporal differences. Machine Learning, 9-44 (1988)
    • (1988) Machine Learning , pp. 9-44
    • Sutton, R.S.1
  • 9
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning 47(2/3), 235-256 (2002)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 10
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985)
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.1    Robbins, H.2
  • 11
    • 26944466214 scopus 로고    scopus 로고
    • Function approximation via tile coding: Automating parameter choice
    • Zucker, J.-D., Saitta, L. (eds.) SARA 2005. Springer, Heidelberg
    • Sherstov, E.A., Stone, P.: Function approximation via tile coding: Automating parameter choice. In: Zucker, J.-D., Saitta, L. (eds.) SARA 2005. LNCS (LNAI), vol.3607, pp. 194-205. Springer, Heidelberg (2005)
    • (2005) LNCS (LNAI) , vol.3607 , pp. 194-205
    • Sherstov, E.A.1    Stone, P.2
  • 12
    • 78951480078 scopus 로고    scopus 로고
    • Creating an Upper-Confidence-Tree program for Havannah
    • Pamplona Espagne
    • Teytaud, F., Teytaud, O.: Creating an Upper-Confidence-Tree program for Havannah. In: Advances in Computer Games 12, Pamplona Espagne (2009)
    • (2009) Advances in Computer Games , vol.12
    • Teytaud, F.1    Teytaud, O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.