SCOPUS 정보 검색 플랫폼

Volumn 4212 LNAI, Issue , 2006, Pages 282-293

Bandit based Monte-Carlo planning

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; BOUNDARY VALUE PROBLEMS; ERROR ANALYSIS; LEARNING ALGORITHMS; LEARNING SYSTEMS; MONTE CARLO METHODS;

FINITE SAMPLE BOUNDS; MONTE-CARLO PLANNING; OPTIMAL SOLUTIONS;

MARKOV PROCESSES;

EID: 33750293964 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/11871842_29 Document Type: Conference Paper

Times cited : (2564)

References (14)

1
- 0036568025
- Finite time analysis of the multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

2
- 0037709910
- The nonstochastic multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, Y. Freund, and R.E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32:48-77, 2002.
- (2002) SIAM Journal on Computing , vol.32 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

3
- 0003915098
- Technical report 91-57, Computer Science Department, University of Massachusetts
- A.G. Barto, S.J. Bradtke, and S.P. Singh. Real-time learning and control using asynchronous dynamic programming. Technical report 91-57, Computer Science Department, University of Massachusetts, 1991.
- (1991) Real-time Learning and Control Using Asynchronous Dynamic Programming
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

4
- 0036149710
- The challenge of poker
- D. Billings, A. Davidson, J. Schaeffer, and D. Szafron. The challenge of poker. Artificial Intelligence, 134:201-240, 2002.
- (2002) Artificial Intelligence , vol.134 , pp. 201-240
- Billings, D.¹ Davidson, A.² Schaeffer, J.³ Szafron, D.⁴

6
- 14644444172
- An adaptive sampling algorithm for solving Markov decision processes
- H.S. Chang, M. Fu, J. Hu, and S.I. Marcus. An adaptive sampling algorithm for solving Markov decision processes. Operations Research, 53(1):126-139, 2005.
- (2005) Operations Research , vol.53 , Issue.1 , pp. 126-139
- Chang, H.S.¹ Fu, M.² Hu, J.³ Marcus, S.I.⁴

7
- 80053628578
- Monte Carlo planning in RTS games
- M. Chung, M. Buro, and J. Schaeffer. Monte Carlo planning in RTS games. In CIG 2005, Colchester, UK, 2005.
- (2005) CIG 2005, Colchester, UK
- Chung, M.¹ Buro, M.² Schaeffer, J.³

8
- 84880649215
- A sparse sampling algorithm for nearoptimal planning in large Markovian decisi on processes
- M. Kearns, Y. Mansour, and A.Y. Ng. A sparse sampling algorithm for nearoptimal planning in large Markovian decisi on processes. In Proceedings of IJCAI'99, pages 1324-1331, 1999.
- (1999) Proceedings of IJCAI'99 , pp. 1324-1331
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

9
- 0002899547
- Asymptotically efficient adaptive allocation rules
- T.L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

11
- 0036146034
- World-championship-caliber Scrabble
- B. Sheppard. World-championship-caliber Scrabble. Artificial Intelligence, 134(1-2):241-275, 2002.
- (2002) Artificial Intelligence , vol.134 , Issue.1-2 , pp. 241-275
- Sheppard, B.¹

12
- 0028604197
- An analysis of forward pruning
- S.J.J. Smith and D.S. Nau. An analysis of forward pruning. In AAAI, pages 1386-1391, 1994.
- (1994) AAAI , pp. 1386-1391
- Smith, S.J.J.¹ Nau, D.S.²

14
- 33750350789
- University of Princeton
- R. Vanderbei. Optimal sailing strategies, statistics and operations research program. University of Princeton, http://www.sor.princeton.edu/r~vdb/ sail/sail.html., 1996.
- (1996) Optimal Sailing Strategies, Statistics and Operations Research Program
- Vanderbei, R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.