SCOPUS 정보 검색 플랫폼

3rd International Symposium on Intelligent Information Technology Application, IITA 2009

Volumn 2, Issue , 2009, Pages 125-128

Backpropagation Modification in Monte-Carlo Game Tree Search

(2) Xie, Fan a Liu, Zhiqing a

a BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS (China)

Author keywords

Machine learning; Monte carlo tree search; Weight factor

Indexed keywords

EXPONENTIAL MODELS; GAME TREE SEARCH; MACHINE-LEARNING; MONTE CARLO; MONTE CARLO SIMULATION; MULTI-ARMED BANDIT PROBLEM; SEARCH SPACES; TREE SEARCH; WEIGHT FACTOR;

INFORMATION TECHNOLOGY;

LEARNING SYSTEMS;

EID: 77649307992 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IITA.2009.331 Document Type: Conference Paper

Times cited : (15)

References (11)

1
- 84880649215
- A sparse sampling algorithm for near optimal planning in large Markovian decision processes
- M. Kearns, Y. Mansour, and A.Y. Ng: A sparse sampling algorithm for near optimal planning in large Markovian decision processes. In Proceedings of IJCAI'99, pages 1324-1331, 1999.
- (1999) Proceedings of IJCAI'99 , pp. 1324-1331
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

2
- 77649304247
- Gelly,S.,Wang,Y.: Exploration exploitation in go: UCT for Monte-Carlo go. In:NIPS-2006:On-line trading of Exploration and Exploitation Workshop, Whistler Canada (2006)
- Gelly,S.,Wang,Y.: Exploration exploitation in go: UCT for Monte-Carlo go. In:NIPS-2006:On-line trading of Exploration and Exploitation Workshop, Whistler Canada (2006)

3
- 33750293964
- Bandit Based Monte-Carlo Planning
- Kocsis, L., Szepesvari, C.: Bandit Based Monte-Carlo Planning. In: 15th European Conference n Machine Learning (ECML), pages 282 293, 2006
- (2006) 15th European Conference n Machine Learning (ECML) , pp. 282-293
- Kocsis, L.¹ Szepesvari, C.²

4
- 0003915098
- Real-time learning and control using asynchronous dynamic programming
- Technical report 91-57, Computer Science Department, University of Massachusetts
- A.G. Barto, S.J. Bradtke, and S.P. Singh. Real-time learning and control using asynchronous dynamic programming. Technical report 91-57, Computer Science Department, University of Massachusetts, 1991.
- (1991)
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

5
- 0036149710
- The challenge of poker
- D. Billings, A. Davidson, J. Schaeer, and D. Szafron. The challenge of poker. Articial Intelligence, 134:201-240, 2002.
- (2002) Articial Intelligence , vol.134 , pp. 201-240
- Billings, D.¹ Davidson, A.² Schaeer, J.³ Szafron, D.⁴

6
- 0036568025
- Finite time analysis of the multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

7
- 34547990649
- Combining Online and Offine Knowledge in UCT. In: Ghahramani
- Gelly, S., Silver, D.: Combining Online and Offine Knowledge in UCT. In: Ghahramani, Z.(ed.)the International Conference of Machine Learning (ICML 2007), pp. 273C280 (2007)
- (2007) Z.(ed.)the International Conference of Machine Learning (ICML , Issue.C280 , pp. 273
- Gelly, S.¹ Silver, D.²

8
- 55249127519
- Progressive strategies for Monte-Carlo Tree Search
- H.J. van den Herik, and B. Bouzy. Progressive strategies for Monte-Carlo Tree Search. New Mathematics and Natural Computation, 4(3), 2008.
- (2008) New Mathematics and Natural Computation , vol.4 , Issue.3
- van den Herik, H.J.¹ Bouzy, B.²

9
- 84898992015
- G. Tesauro and G.R. Galperin. On-line policy improvement using Monte-Carlo search. In M.C. Mozer, M.I. Jordan, and T. Petsche, editors, NIPS 9, pages 1068-1074, 1997.
- G. Tesauro and G.R. Galperin. On-line policy improvement using Monte-Carlo search. In M.C. Mozer, M.I. Jordan, and T. Petsche, editors, NIPS 9, pages 1068-1074, 1997.

10
- 0036146034
- World-championship-caliber Scrabble
- B. Sheppard. World-championship-caliber Scrabble. Artificial Intelligence, 134(1-2):241-275, 2002.
- (2002) Artificial Intelligence , vol.134 , Issue.1-2 , pp. 241-275
- Sheppard, B.¹

11
- 0036149616
- Computer Go
- M. Müller, Computer Go, Artificial Intelligence, 134, 2002, pp. 145-179.
- (2002) Artificial Intelligence , vol.134 , pp. 145-179
- Müller, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.