메뉴 건너뛰기




Volumn 5323 LNAI, Issue , 2008, Pages 151-164

Optimistic planning of deterministic dsstems

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT; REINFORCEMENT LEARNING;

EID: 58449106591     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-89722-4_12     Document Type: Conference Paper
Times cited : (78)

References (11)
  • 1
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning Journal 47(2-3), 235-256 (2002)
    • (2002) Machine Learning Journal , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 4
    • 34250659969 scopus 로고    scopus 로고
    • Modification of UCT with patterns in Monte-Carlo go
    • Technical Report INRIA RR-6062
    • Gelly, S., Wang, Y., Munos, R., Teytaud, O.: Modification of UCT with patterns in Monte-Carlo go. Technical Report INRIA RR-6062 (2006)
    • (2006)
    • Gelly, S.1    Wang, Y.2    Munos, R.3    Teytaud, O.4
  • 5
    • 0036832951 scopus 로고    scopus 로고
    • A sparse sampling algorithm for near-optimal planning in large Markovian decision processes
    • Kearns, M., Mansour, Y., Ng, A.Y.: A sparse sampling algorithm for near-optimal planning in large Markovian decision processes. Machine Learning 49, 193-208 (2002)
    • (2002) Machine Learning , vol.49 , pp. 193-208
    • Kearns, M.1    Mansour, Y.2    Ng, A.Y.3
  • 7
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T.L., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985)
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 10
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • Robbins, H.: Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Society 58, 527-535 (1952)
    • (1952) Bulletin of the American Mathematics Society , vol.58 , pp. 527-535
    • Robbins, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.