SCOPUS 정보 검색 플랫폼

Volumn 227, Issue , 2007, Pages 273-280

Combining online and offline knowledge in UCT

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; FUNCTIONAL ANALYSIS; MONTE CARLO METHODS; ONLINE SYSTEMS; RANDOM PROCESSES;

OFFLINE VALUE FUNCTION; SIMULATION POLICY; VALUE FUNCTIONS;

LEARNING ALGORITHMS;

EID: 34547990649 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1273496.1273531 Document Type: Conference Paper

Times cited : (458)

References (16)

1
- 0036568025
- Finite-time analysis of the multi-armed bandit problem
- Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multi-armed bandit problem. Machine Learning, 47, 235-256.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

2
- 0004870198
- Experiments in parameter learning using temporal differences
- Baxter, J., Tridgell, A., & Weaver, L. (1998). Experiments in parameter learning using temporal differences. International Computer Chess Association Journal, 21, 84-99.
- (1998) International Computer Chess Association Journal , vol.21 , pp. 84-99
- Baxter, J.¹ Tridgell, A.² Weaver, L.³

3
- 0003640133
- Bruegmann, B. (1993). Monte-Carlo Go. http://www.cgl.ucsf.edu/go/ Programs/Gobble.html.
- (1993) Monte-Carlo Go
- Bruegmann, B.¹

4
- 84956863737
- From simple features to sophisticated evaluation functions
- Buro, M. (1999). From simple features to sophisticated evaluation functions. 1st International Conference on Computers and Games (pp. 126-145).
- (1999) 1st International Conference on Computers and Games , pp. 126-145
- Buro, M.¹

6
- 24944563092
- Evaluation in Go by a neural network using soft segmentation
- Enzenberger, M. (2003). Evaluation in Go by a neural network using soft segmentation. 10th Advances in Computer Games Conference (pp. 97-108).
- (2003) 10th Advances in Computer Games Conference , pp. 97-108
- Enzenberger, M.¹

8
- 33750293964
- Bandit based Monte-Carlo planning
- Kocsis, L., & Szepesvari, C. (2006). Bandit based Monte-Carlo planning. 15th European Conference on Machine Learning (pp. 282-293).
- (2006) 15th European Conference on Machine Learning , pp. 282-293
- Kocsis, L.¹ Szepesvari, C.²

9
- 0038145011
- Temporal difference learning applied to a high-performance game-playing program
- Schaeffer, J., Hlynka, M., & Jussila, V. (2001). Temporal difference learning applied to a high-performance game-playing program. 17th International Joint Conference on Artificial Intelligence (pp. 529-534).
- (2001) 17th International Joint Conference on Artificial Intelligence , pp. 529-534
- Schaeffer, J.¹ Hlynka, M.² Jussila, V.³

11
- 84880900542
- Reinforcement learning of local shape in the game of Go
- Silver, D., Sutton, R., & Müller, M. (2007). Reinforcement learning of local shape in the game of Go. 20th International Joint Conference on Artificial Intelligence (pp. 1053-1058).
- (2007) 20th International Joint Conference on Artificial Intelligence , pp. 1053-1058
- Silver, D.¹ Sutton, R.² Müller, M.³

12
- 33847202724
- Learning to predict by the method of temporal differences
- Sutton, R. (1988). Learning to predict by the method of temporal differences. Machine Learning, 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.¹

13
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- Sutton, R. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. 7th International Conference on Machine Learning (pp. 216-224).
- (1990) 7th International Conference on Machine Learning , pp. 216-224
- Sutton, R.¹

14
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- Sutton, R. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. Advances in Neural Information Processing Systems 8 (pp. 1038-1044).
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
- Sutton, R.¹

15
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R., & Barto, A. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R.¹ Barto, A.²

16
- 34547981323
- Modifications of UCT and sequence-like simulations for Monte-Carlo Go
- Wang, Y., & Gelly, S. (2007). Modifications of UCT and sequence-like simulations for Monte-Carlo Go. IEEE Symposium on Computational Intelligence and Games, Honolulu, Hawaii (pp. 175-182).
- (2007) IEEE Symposium on Computational Intelligence and Games, Honolulu, Hawaii , pp. 175-182
- Wang, Y.¹ Gelly, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.