메뉴 건너뛰기




Volumn , Issue , 2011, Pages 649-654

Nested rollout policy adaptation for Monte Carlo tree search

Author keywords

[No Author keywords available]

Indexed keywords

DETERMINISTIC OPTIMIZATION PROBLEMS; DOMAIN-SPECIFIC POLICIES; MONTE CARLO TREE SEARCH (MCTS); MONTE-CARLO TREE SEARCHES; NESTED MONTE-CARLO; NESTED ROLLOUT POLICY ADAPTATIONS; RANDOMIZED DECISIONS; SEARCH EFFICIENCY;

EID: 84881043243     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.5591/978-1-57735-516-8/IJCAI11-115     Document Type: Conference Paper
Times cited : (119)

References (30)
  • 1
    • 79951744230 scopus 로고    scopus 로고
    • Nested MonteCarlo search with AMAF heuristic
    • H. Akiyama et al. Nested MonteCarlo search with AMAF heuristic. In TAAI, 2010.
    • (2010) TAAI
    • Akiyama, H.1
  • 2
    • 84881056863 scopus 로고    scopus 로고
    • Differential training of roll-out policies
    • D. Bertsekas. Differential training of roll-out policies. In Allerton Conf., 1997.
    • Allerton Conf., 1997
    • Bertsekas, D.1
  • 3
    • 37149054260 scopus 로고    scopus 로고
    • Searching solitaire in real time
    • R. Bjarnason et al. Searching solitaire in real time. ICGA J., 2007.
    • (2007) ICGA J.
    • Bjarnason, R.1
  • 5
    • 84881045464 scopus 로고    scopus 로고
    • C. Boyer. http://morpionsolitaire.com, 2011.
    • (2011)
    • Boyer, C.1
  • 7
    • 70450082828 scopus 로고    scopus 로고
    • Reflexive Monte-Carlo search
    • T. Cazenave. Reflexive Monte-Carlo search. In CGW, 2007.
    • (2007) CGW
    • Cazenave, T.1
  • 8
    • 72449151723 scopus 로고    scopus 로고
    • Nested Monte-Carlo search
    • T. Cazenave. Nested Monte-Carlo search. In IJCAI, 2009.
    • (2009) IJCAI
    • Cazenave, T.1
  • 9
    • 84881060951 scopus 로고    scopus 로고
    • Monte-Carlo bus regulation
    • T. Cazenave et al. Monte-Carlo bus regulation. In ITSC, 2009.
    • (2009) ITSC
    • Cazenave, T.1
  • 10
    • 38849139064 scopus 로고    scopus 로고
    • Computing Elo ratings of move patterns in the game of Go
    • R. Coulom. Computing Elo ratings of move patterns in the game of Go. In CGW, 2007.
    • (2007) CGW
    • Coulom, R.1
  • 11
    • 84881077997 scopus 로고    scopus 로고
    • August
    • GAMES Magazine, page 76, August 2006.
    • (2006) GAMES Magazine , pp. 76
  • 12
    • 84881079209 scopus 로고    scopus 로고
    • December
    • Winning solution: page 93, December 2006.
    • (2006) Winning Solution , pp. 93
  • 13
    • 84881078906 scopus 로고
    • June
    • GAMES Magazine, page 8, June 1994.
    • (1994) GAMES Magazine , pp. 8
  • 14
    • 84881038424 scopus 로고
    • October
    • Winning solution: page 67, October 1994.
    • (1994) Winning Solution , pp. 67
  • 15
    • 84881075124 scopus 로고    scopus 로고
    • Bandit-based optimization for library performance tuning
    • F. de Mesmay et al. Bandit-based optimization for library performance tuning. ICML, 2009.
    • (2009) ICML
    • De Mesmay, F.1
  • 17
    • 58349113822 scopus 로고    scopus 로고
    • Approximate policy iteration with a policy language bias
    • A. Fern et al. Approximate policy iteration with a policy language bias. In NIPS, 2003.
    • (2003) NIPS
    • Fern, A.1
  • 18
    • 84893360936 scopus 로고    scopus 로고
    • Learning simulation control in GGP agents
    • H. Finnsson et al. Learning simulation control in GGP agents. In AAAI, 2010.
    • (2010) AAAI
    • Finnsson, H.1
  • 20
    • 77958578450 scopus 로고    scopus 로고
    • Combining online and offline knowledge in UCT
    • S. Gelly et al. Combining online and offline knowledge in UCT. In ICML, 2007.
    • (2007) ICML
    • Gelly, S.1
  • 21
    • 84886020812 scopus 로고
    • Search lessons learned from crossword puzzles
    • M. Ginsberg et al. Search lessons learned from crossword puzzles. In AAAI, 1990.
    • (1990) AAAI
    • Ginsberg, M.1
  • 23
    • 84881068270 scopus 로고    scopus 로고
    • Combining UCT and NMCS for single-player GGP
    • J. Méhat et al. Combining UCT and NMCS for single-player GGP. IEEE TCIAIG, 2010.
    • (2010) IEEE TCIAIG
    • Méhat, J.1
  • 24
    • 78650622800 scopus 로고    scopus 로고
    • Monte-Carlo exploration for deterministic planning
    • H. Nakhost et al. Monte-Carlo exploration for deterministic planning. IJCAI, 2009.
    • (2009) IJCAI
    • Nakhost, H.1
  • 25
    • 84881052253 scopus 로고    scopus 로고
    • Optimization of the Nested Monte-Carlo Algorithm on the Traveling Salesman Problem with Time Windows
    • A. Rimmel et al. Optimization of the Nested Monte-Carlo Algorithm on the Traveling Salesman Problem with Time Windows. In Evostar, 2011.
    • (2011) Evostar
    • Rimmel, A.1
  • 26
    • 79951729216 scopus 로고    scopus 로고
    • UCD: Upper confidence bound for directed acyclic graphs
    • A. Saffidine et al. UCD: Upper confidence bound for directed acyclic graphs. In TAAI, 2010.
    • (2010) TAAI
    • Saffidine, A.1
  • 27
    • 71149102015 scopus 로고    scopus 로고
    • Monte-Carlo simulation balancing
    • D. Silver et al. Monte-Carlo simulation balancing. In ICML, 2009.
    • (2009) ICML
    • Silver, D.1
  • 28
    • 85161963598 scopus 로고    scopus 로고
    • Monte-Carlo planning in large POMDPs
    • D. Silver et al. Monte-Carlo planning in large POMDPs. In NIPS, 2010.
    • (2010) NIPS
    • Silver, D.1
  • 29
    • 0001332415 scopus 로고    scopus 로고
    • On-line policy improvement using Monte-Carlo search
    • G. Tesauro et al. On-line policy improvement using Monte-Carlo search. In NIPS, 1996.
    • (1996) NIPS
    • Tesauro, G.1
  • 30
    • 79956344726 scopus 로고    scopus 로고
    • A Monte-Carlo AIXI approximation
    • J. Veness et al. A Monte-Carlo AIXI approximation. JAIR, 2011.
    • (2011) JAIR
    • Veness, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.