SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2011, Pages 649-654

Nested rollout policy adaptation for Monte Carlo tree search

(1) Rosin, Christopher D a

a Parity Computing Inc (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DETERMINISTIC OPTIMIZATION PROBLEMS; DOMAIN-SPECIFIC POLICIES; MONTE CARLO TREE SEARCH (MCTS); MONTE-CARLO TREE SEARCHES; NESTED MONTE-CARLO; NESTED ROLLOUT POLICY ADAPTATIONS; RANDOMIZED DECISIONS; SEARCH EFFICIENCY;

ARTIFICIAL INTELLIGENCE; OPTIMIZATION;

FORESTRY;

FORESTRY; OPTIMIZATION; PROBLEM SOLVING;

EID: 84881043243 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: 10.5591/978-1-57735-516-8/IJCAI11-115 Document Type: Conference Paper

Times cited : (119)

References (30)

1
- 79951744230
- Nested MonteCarlo search with AMAF heuristic
- H. Akiyama et al. Nested MonteCarlo search with AMAF heuristic. In TAAI, 2010.
- (2010) TAAI
- Akiyama, H.¹

2
- 84881056863
- Differential training of roll-out policies
- D. Bertsekas. Differential training of roll-out policies. In Allerton Conf., 1997.
- Allerton Conf., 1997
- Bertsekas, D.¹

3
- 37149054260
- Searching solitaire in real time
- R. Bjarnason et al. Searching solitaire in real time. ICGA J., 2007.
- (2007) ICGA J.
- Bjarnason, R.¹

4
- 84881050740
- Nov.
- C. Boyer. Science & Vie, page 144, Nov. 2010.
- (2010) Science & Vie , pp. 144
- Boyer, C.¹

5
- 84881045464
- C. Boyer. http://morpionsolitaire.com, 2011.
- (2011)
- Boyer, C.¹

6
- 84881062330
- April
- C.-H. Bruneau. Science & Vie, April 1976.
- (1976) Science & Vie
- Bruneau, C.-H.¹

7
- 70450082828
- Reflexive Monte-Carlo search
- T. Cazenave. Reflexive Monte-Carlo search. In CGW, 2007.
- (2007) CGW
- Cazenave, T.¹

8
- 72449151723
- Nested Monte-Carlo search
- T. Cazenave. Nested Monte-Carlo search. In IJCAI, 2009.
- (2009) IJCAI
- Cazenave, T.¹

9
- 84881060951
- Monte-Carlo bus regulation
- T. Cazenave et al. Monte-Carlo bus regulation. In ITSC, 2009.
- (2009) ITSC
- Cazenave, T.¹

10
- 38849139064
- Computing Elo ratings of move patterns in the game of Go
- R. Coulom. Computing Elo ratings of move patterns in the game of Go. In CGW, 2007.
- (2007) CGW
- Coulom, R.¹

11
- 84881077997
- August
- GAMES Magazine, page 76, August 2006.
- (2006) GAMES Magazine , pp. 76

12
- 84881079209
- December
- Winning solution: page 93, December 2006.
- (2006) Winning Solution , pp. 93

13
- 84881078906
- June
- GAMES Magazine, page 8, June 1994.
- (1994) GAMES Magazine , pp. 8

14
- 84881038424
- October
- Winning solution: page 67, October 1994.
- (1994) Winning Solution , pp. 67

15
- 84881075124
- Bandit-based optimization for library performance tuning
- F. de Mesmay et al. Bandit-based optimization for library performance tuning. ICML, 2009.
- (2009) ICML
- De Mesmay, F.¹

16
- 33646469243
- Morpion Solitaire
- E. D. Demaine et al. Morpion Solitaire. Theory Comput. Syst., 2006.
- (2006) Theory Comput. Syst.
- Demaine, E.D.¹

17
- 58349113822
- Approximate policy iteration with a policy language bias
- A. Fern et al. Approximate policy iteration with a policy language bias. In NIPS, 2003.
- (2003) NIPS
- Fern, A.¹

18
- 84893360936
- Learning simulation control in GGP agents
- H. Finnsson et al. Learning simulation control in GGP agents. In AAAI, 2010.
- (2010) AAAI
- Finnsson, H.¹

19
- 0004127488
- M. Garey and D. Johnson. Computers and Intractability. 1979.
- (1979) Computers and Intractability
- Garey, M.¹ Johnson, D.²

20
- 77958578450
- Combining online and offline knowledge in UCT
- S. Gelly et al. Combining online and offline knowledge in UCT. In ICML, 2007.
- (2007) ICML
- Gelly, S.¹

21
- 84886020812
- Search lessons learned from crossword puzzles
- M. Ginsberg et al. Search lessons learned from crossword puzzles. In AAAI, 1990.
- (1990) AAAI
- Ginsberg, M.¹

22
- 0041481252
- Kluwer
- P. Larranaga et al. Estimation of Distribution Algorithms. Kluwer, 2002.
- (2002) Estimation of Distribution Algorithms
- Larranaga, P.¹

23
- 84881068270
- Combining UCT and NMCS for single-player GGP
- J. Méhat et al. Combining UCT and NMCS for single-player GGP. IEEE TCIAIG, 2010.
- (2010) IEEE TCIAIG
- Méhat, J.¹

24
- 78650622800
- Monte-Carlo exploration for deterministic planning
- H. Nakhost et al. Monte-Carlo exploration for deterministic planning. IJCAI, 2009.
- (2009) IJCAI
- Nakhost, H.¹

25
- 84881052253
- Optimization of the Nested Monte-Carlo Algorithm on the Traveling Salesman Problem with Time Windows
- A. Rimmel et al. Optimization of the Nested Monte-Carlo Algorithm on the Traveling Salesman Problem with Time Windows. In Evostar, 2011.
- (2011) Evostar
- Rimmel, A.¹

26
- 79951729216
- UCD: Upper confidence bound for directed acyclic graphs
- A. Saffidine et al. UCD: Upper confidence bound for directed acyclic graphs. In TAAI, 2010.
- (2010) TAAI
- Saffidine, A.¹

27
- 71149102015
- Monte-Carlo simulation balancing
- D. Silver et al. Monte-Carlo simulation balancing. In ICML, 2009.
- (2009) ICML
- Silver, D.¹

28
- 85161963598
- Monte-Carlo planning in large POMDPs
- D. Silver et al. Monte-Carlo planning in large POMDPs. In NIPS, 2010.
- (2010) NIPS
- Silver, D.¹

29
- 0001332415
- On-line policy improvement using Monte-Carlo search
- G. Tesauro et al. On-line policy improvement using Monte-Carlo search. In NIPS, 1996.
- (1996) NIPS
- Tesauro, G.¹

30
- 79956344726
- A Monte-Carlo AIXI approximation
- J. Veness et al. A Monte-Carlo AIXI approximation. JAIR, 2011.
- (2011) JAIR
- Veness, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.