SCOPUS 정보 검색 플랫폼

Adaptation, Learning, and Optimization

Volumn 12, Issue , 2012, Pages 539-577

Reinforcement learning in games

(1) Szita, István a

a UNIVERSITY OF ALBERTA (Canada)

Author keywords

Coherence; Covariance; Defend; Income; Nash

Indexed keywords

EID: 84867399396 PISSN: 18674534 EISSN: 18674542 Source Type: Book Series
DOI: 10.1007/978-3-642-27645-3_17 Document Type: Chapter

Times cited : (52)

References (111)

1
- 26944456343
- Learning to win: Case-based plan selection in a real-time strategy game
- Aha, D.W., Molineaux, M., Ponsen, M.: Learning to win: Case-based plan selection in a real-time strategy game. Case-Based Reasoning Research and Development, 5–20 (2005)
- (2005) Case-Based Reasoning Research and Development , pp. 5-20
- Aha, D.W.¹ Molineaux, M.² Ponsen, M.³

2
- 33744825172
- Learning to bid in bridge
- Amit, A., Markovitch, S.: Learning to bid in bridge. Machine Learning 63(3), 287–327 (2006)
- (2006) Machine Learning , vol.63 , Issue.3 , pp. 287-327
- Amit, A.¹ Markovitch, S.²

3
- 84883207406
- Online adaptation of computer games agents: A reinforcement learning approach
- Andrade, G., Santana, H., Furtado, A., Leitão, A., Ramalho, G.: Online adaptation of computer games agents: A reinforcement learning approach. Scientia 15(2) (2004)
- (2004) Scientia , vol.15 , Issue.2
- Andrade, G.¹ Santana, H.² Furtado, A.³ Leitão, A.⁴ Ramalho, G.⁵

4
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning 47, 235–256 (2002)
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

5
- 76349090438
- Models of active learning in group-structured state spaces
- Bartók, G., Szepesvári, C., Zilles, S.: Models of active learning in group-structured state spaces. Information and Computation 208, 364–384 (2010)
- (2010) Information and Computation , vol.208 , pp. 364-384
- Bartók, G.¹ Szepesvári, C.² Zilles, S.³

6
- 0034275416
- Learning to play chess using temporal-differences
- Baxter, J., Tridgell, A., Weaver, L.: Learning to play chess using temporal-differences. Machine learning 40(3), 243–263 (2000)
- (2000) Machine Learning , vol.40 , Issue.3 , pp. 243-263
- Baxter, J.¹ Tridgell, A.² Weaver, L.³

7
- 68049122815
- Reinforcement learning and chess
- Nova Science Publishers, Inc
- Baxter, J., Tridgell, A., Weaver, L.: Reinforcement learning and chess. In: Machines that learn to play games, pp. 91–116. Nova Science Publishers, Inc. (2001)
- (2001) Machines that Learn to Play Games , pp. 91-116
- Baxter, J.¹ Tridgell, A.² Weaver, L.³

8
- 0004502426
- Learning piece values using temporal differences
- Beal, D., Smith, M.C.: Learning piece values using temporal differences. ICCA Journal 20(3), 147–151 (1997)
- (1997) ICCA Journal , vol.20 , Issue.3 , pp. 147-151
- Beal, D.¹ Smith, M.C.²

9
- 0003351108
- Neuro-Dynamic Programming
- Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific (1996)
- (1996) Athena Scientific
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

10
- 33745168595
- Game-Tree Search with Adaptation in Stochastic Imperfect-Information Games
- van den Herik, H.J., Björnsson, Y., Netanyahu, N.S. (eds.), Springer, Heidelberg
- Billings, D., Davidson, A., Schauenberg, T., Burch, N., Bowling, M., Holte, R.C., Schaeffer, J., Szafron, D.: Game-Tree Search with Adaptation in Stochastic Imperfect-Information Games. In: van den Herik, H.J., Björnsson, Y., Netanyahu, N.S. (eds.) CG 2004. LNCS, vol. 3846, pp. 21–34. Springer, Heidelberg (2006)
- (2006) CG 2004. LNCS , vol.3846 , pp. 21-34
- Billings, D.¹ Davidson, A.² Schauenberg, T.³ Burch, N.⁴ Bowling, M.⁵ Holte, R.C.⁶ Schaeffer, J.⁷ Szafron, D.⁸

11
- 71549126935
- Cadiaplayer: A simulation-based general game player
- Björnsson, Y., Finnsson, H.: Cadiaplayer: A simulation-based general game player. IEEE Transactions on Computational Intelligence and AI in Games 1(1), 4–15 (2009)
- (2009) IEEE Transactions on Computational Intelligence and AI in Games , vol.1 , Issue.1 , pp. 4-15
- Björnsson, Y.¹ Finnsson, H.²

12
- 33845339117
- Evolving a heuristic function for the game of tetris
- Böhm, N., Kókai, G., Mandl, S.: Evolving a heuristic function for the game of tetris. In: Proc. Lernen, Wissensentdeckung und Adaptivität LWA, pp. 118–122 (2004)
- (2004) Proc. Lernen, Wissensentdeckung Und Adaptivität LWA , pp. 118-122
- Böhm, N.¹ Kókai, G.² Mandl, S.³

13
- 71549125902
- On the evolution of artificial Tetris players
- Boumaza, A.: On the evolution of artificial Tetris players. In: IEEE Symposium on Computational Intelligence and Games (2009)
- (2009) IEEE Symposium on Computational Intelligence and Games
- Boumaza, A.¹

14
- 84902513084
- Monte Carlo Go developments
- Bouzy, B., Helmstetter, B.: Monte Carlo Go developments. In: Advances in Computer Games, pp. 159–174 (2003)
- (2003) Advances in Computer Games , pp. 159-174
- Bouzy, B.¹ Helmstetter, B.²

15
- 31844436490
- Convergence and no-regret in multiagent learning
- Bowling, M.: Convergence and no-regret in multiagent learning. In: Neural Information Processing Systems, pp. 209–216 (2004)
- (2004) Neural Information Processing Systems , pp. 209-216
- Bowling, M.¹

16
- 84956863737
- From simple features to sophisticated evaluation functions
- Buro, M.: From simple features to sophisticated evaluation functions. In: International Conference on Computers and Games, pp. 126–145 (1998)
- (1998) International Conference on Computers and Games , pp. 126-145
- Buro, M.¹

17
- 33744829091
- RTS games as test-bed for real-time research
- Buro, M., Furtak, T.: RTS games as test-bed for real-time research. JCIS, 481–484 (2003)
- (2003) JCIS , pp. 481-484
- Buro, M.¹ Furtak, T.²

18
- 84898602823
- The second annual real-time strategy game AI competition
- Buro, M., Lanctot, M., Orsten, S.: The second annual real-time strategy game AI competition. In: GAME-ON NA (2007)
- (2007) GAME-ON NA
- Buro, M.¹ Lanctot, M.² Orsten, S.³

19
- 55249127519
- Progressive strategies for monte-carlo tree search
- Chaslot, G., Winands, M., Herik, H., Uiterwijk, J., Bouzy, B.: Progressive strategies for monte-carlo tree search. New Mathematics and Natural Computation 4(3), 343 (2008)
- (2008) New Mathematics and Natural Computation , vol.4 , Issue.3 , pp. 343
- Chaslot, G.¹ Winands, M.² Herik, H.³ Uiterwijk, J.⁴ Bouzy, B.⁵

20
- 77953762833
- Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search
- van den Herik, H.J., Spronck, P. (eds.), Springer, Heidelberg
- Chaslot, G., Fiter, C., Hoock, J.B., Rimmel, A., Teytaud, O.: Adding Expert Knowledge and Exploration in Monte-Carlo Tree Search. In: van den Herik, H.J., Spronck, P. (eds.) ACG 2009. LNCS, vol. 6048, pp. 1–13. Springer, Heidelberg (2010)
- (2010) ACG 2009. LNCS , vol.6048 , pp. 1-13
- Chaslot, G.¹ Fiter, C.² Hoock, J.B.³ Rimmel, A.⁴ Teytaud, O.⁵

21
- 77955690350
- Including expert knowledge in bandit-based Monte-Carlo planning, with application to computer-Go
- Chatriot, L., Gelly, S., Jean-Baptiste, H., Perez, J., Rimmel, A., Teytaud, O.: Including expert knowledge in bandit-based Monte-Carlo planning, with application to computer-Go. In: European Workshop on Reinforcement Learning (2008)
- (2008) European Workshop on Reinforcement Learning
- Chatriot, L.¹ Gelly, S.² Jean-Baptiste, H.³ Perez, J.⁴ Rimmel, A.⁵ Teytaud, O.⁶

22
- 70349275222
- Bandit algorithms for tree search
- Coquelin, P.A., Munos, R.: Bandit algorithms for tree search. In: Uncertainty in Artificial Intelligence (2007)
- (2007) Uncertainty in Artificial Intelligence
- Coquelin, P.A.¹ Munos, R.²

23
- 38049037928
- Efficient Selectivity and Backup Operators in Monte-carlo Tree Search
- van den Herik, H.J., Ciancarini, P., Donkers, H.H.L.M(J.) (eds.) CG 2006, Springer, Heidelberg
- Coulom, R.: Efficient Selectivity and Backup Operators in Monte-carlo Tree Search. In: van den Herik, H.J., Ciancarini, P., Donkers, H.H.L.M(J.) (eds.) CG 2006. LNCS, vol. 4630, pp. 72–83. Springer, Heidelberg (2007)
- (2007) LNCS , vol.4630 , pp. 72-83
- Coulom, R.¹

24
- 38849139064
- Computing Elo ratings of move patterns in the game of go
- Coulom, R.: Computing Elo ratings of move patterns in the game of go. ICGA Journal 30(4), 198–208 (2007)
- (2007) ICGA Journal , vol.30 , Issue.4 , pp. 198-208
- Coulom, R.¹

25
- 24944480025
- Honte, a Go-playing program using neural nets
- Nova Science Publishers
- Dahl, F.A.: Honte, a Go-playing program using neural nets. In: Machines that learn to play games, pp. 205–223. Nova Science Publishers (2001)
- (2001) Machines that Learn to Play Games , pp. 205-223
- Dahl, F.A.¹

26
- 33751287499
- Master’s thesis, University of Alberta
- Davidson, A.: Opponent modeling in poker: Learning and acting in a hostile and uncertain environment. Master’s thesis, University of Alberta (2002)
- (2002) Opponent Modeling in Poker: Learning and Acting in a Hostile and Uncertain Environment
- Davidson, A.¹

27
- 56449093331
- An object-oriented representation for efficient reinforcement learning
- Diuk, C., Cohen, A., Littman, M.L.: An object-oriented representation for efficient reinforcement learning. In: International Conference on Machine Learning, pp. 240–247 (2008)
- (2008) International Conference on Machine Learning , pp. 240-247
- Diuk, C.¹ Cohen, A.² Littman, M.L.³

28
- 85042904303
- Tech. Rep. TUD–KE– 2008-07, Knowledge Engineering Group, TU Darmstadt
- Droste, S., Fürnkranz, J.: Learning of piece values for chess variants. Tech. Rep. TUD–KE– 2008-07, Knowledge Engineering Group, TU Darmstadt (2008)
- (2008) Learning of Piece Values for Chess Variants
- Droste, S.¹ Fürnkranz, J.²

29
- 0035312760
- Relational reinforcement learning
- Džeroski, S., Raedt, L.D., Driessens, K.: Relational reinforcement learning. Machine Learning 43(1-2), 7–52 (2001)
- (2001) Machine Learning , vol.43 , Issue.1-2 , pp. 7-52
- Džeroski, S.¹ Raedt, L.D.² Driessens, K.³

30
- 0028443409
- Toward an ideal trainer
- Epstein, S.L.: Toward an ideal trainer. Machine Learning 15, 251–277 (1994)
- (1994) Machine Learning , vol.15 , pp. 251-277
- Epstein, S.L.¹

31
- 33748427607
- Tetris: A Study of Randomized Constraint Sampling
- Springer, UK
- Farias, V.F., van Roy, B.: Tetris: A Study of Randomized Constraint Sampling. In: Probabilistic and Randomized Methods for Design Under Uncertainty. Springer, UK (2006)
- (2006) Probabilistic and Randomized Methods for Design under Uncertainty
- Farias, V.F.¹ Van Roy, B.²

32
- 65849284789
- Automatic feature generation for problem solving systems
- Fawcett, T., Utgoff, P.: Automatic feature generation for problem solving systems. In: International Conference on Machine Learning, pp. 144–153 (1992)
- (1992) International Conference on Machine Learning , pp. 144-153
- Fawcett, T.¹ Utgoff, P.²

33
- 0013372262
- Learning to play chess selectively by acquiring move patterns
- Finkelstein, L., Markovitch, S.: Learning to play chess selectively by acquiring move patterns. ICCA Journal 21, 100–119 (1998)
- (1998) ICCA Journal , vol.21 , pp. 100-119
- Finkelstein, L.¹ Markovitch, S.²

34
- 0004247096
- MIT Press
- Fudenberg, D., Levine, D.K.: The theory of learning in games. MIT Press (1998)
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.K.²

35
- 24544450341
- Machine learning in games: A survey
- Nova Science Publishers
- Fürnkranz, J.: Machine learning in games: a survey. In: Machines that Learn to Play Games, pp. 11–59. Nova Science Publishers (2001)
- (2001) Machines that Learn to Play Games , pp. 11-59
- Fürnkranz, J.¹

36
- 78149321929
- Tech. rep., TU Darmstadt
- Fürnkranz, J.: Recent advances in machine learning and game playing. Tech. rep., TU Darmstadt (2007)
- (2007) Recent Advances in Machine Learning and Game Playing
- Fürnkranz, J.¹

37
- 68949141647
- Machine learning in digital games: A survey
- Galway, L., Charles, D., Black, M.: Machine learning in digital games: a survey. Artificial Intelligence Review 29(2), 123–161 (2008)
- (2008) Artificial Intelligence Review , vol.29 , Issue.2 , pp. 123-161
- Galway, L.¹ Charles, D.² Black, M.³

38
- 57749091602
- Achieving master-level play in 9x9 computer go
- Gelly, S., Silver, D.: Achieving master-level play in 9x9 computer go. In: AAAI, pp. 1537– 1540 (2008)
- (2008) AAAI , pp. 1537-1540
- Gelly, S.¹ Silver, D.²

39
- 34250659969
- Tech. rep., INRIA
- Gelly, S., Wang, Y., Munos, R., Teytaud, O.: Modification of UCT with patterns in Monte-Carlo go. Tech. rep., INRIA (2006)
- (2006) Modification of UCT with Patterns in Monte-Carlo Go
- Gelly, S.¹ Wang, Y.² Munos, R.³ Teytaud, O.⁴

40
- 5844312285
- PhD thesis, University of California, San Diego, CA
- Gherrity, M.: A game-learning machine. PhD thesis, University of California, San Diego, CA (1993)
- (1993) A Game-Learning Machine
- Gherrity, M.¹

41
- 34948832502
- Tech. rep., Department of Computer Science, University of Bristol
- Ghory, I.: Reinforcement learning in board games. Tech. rep., Department of Computer Science, University of Bristol (2004)
- (2004) Reinforcement Learning in Board Games
- Ghory, I.¹

42
- 80052001262
- Fun game AI design for beginners
- Charles River Media, Inc
- Gilgenbach, M.: Fun game AI design for beginners. In: AI Game Programming Wisdom, vol. 3. Charles River Media, Inc. (2006)
- (2006) AI Game Programming Wisdom , vol.3
- Gilgenbach, M.¹

43
- 35348940239
- Lossless abstraction of imperfect information games
- Gilpin, A., Sandholm, T.: Lossless abstraction of imperfect information games. Journal of the ACM 54(5), 25 (2007)
- (2007) Journal of the ACM , vol.54 , Issue.5 , pp. 25
- Gilpin, A.¹ Sandholm, T.²

44
- 84860643163
- Potential-aware automated abstraction of sequential games, and holistic equilibrium analysis of Texas Hold’em poker
- Gilpin, A., Sandholm, T., Sørensen, T.B.: Potential-aware automated abstraction of sequential games, and holistic equilibrium analysis of Texas Hold’em poker. In: AAAI, vol. 22, pp. 50–57 (2007)
- (2007) AAAI , vol.22 , pp. 50-57
- Gilpin, A.¹ Sandholm, T.² Sørensen, T.B.³

45
- 0036374294
- Gib: Imperfect information in a computationally challenging game
- Ginsberg, M.L.: Gib: Imperfect information in a computationally challenging game. Journal of Artificial Intelligence Research 14, 313–368 (2002)
- (2002) Journal of Artificial Intelligence Research , vol.14 , pp. 313-368
- Ginsberg, M.L.¹

46
- 85042906319
- Tech. Rep. UCSC-CRL-92-10, University of California at Santa Cruz
- Gould, J., Levinson, R.: Experience-based adaptive search. Tech. Rep. UCSC-CRL-92-10, University of California at Santa Cruz (1992)
- (1992) Experience-Based Adaptive Search
- Gould, J.¹ Levinson, R.²

47
- 85042908980
- PhD thesis, Dresden University of Technology
- Günther, M.: Automatic feature construction for general game playing. PhD thesis, Dresden University of Technology (2008)
- (2008) Automatic Feature Construction for General Game Playing
- Günther, M.¹

48
- 71549132517
- Measuring player experience on runtime dynamic difficulty scaling in an RTS game
- Hagelbäck, J., Johansson, S.J.: Measuring player experience on runtime dynamic difficulty scaling in an RTS game. In: International Conference on Computational Intelligence and Games (2009)
- (2009) International Conference on Computational Intelligence and Games
- Hagelbäck, J.¹ Johansson, S.J.²

49
- 85042940628
- Online learning from observation for interactive computer games
- Hartley, T., Mehdi, Q., Gough, N.: Online learning from observation for interactive computer games. In: International Conference on Computer Games: Artificial Intelligence and Mobile Systems, pp. 27–30 (2005)
- (2005) International Conference on Computer Games: Artificial Intelligence and Mobile Systems , pp. 27-30
- Hartley, T.¹ Mehdi, Q.² Gough, N.³

50
- 0036149663
- Games solved: Now and in the future
- van den Herik, H.J., Uiterwijk, J.W.H.M., van Rijswijck, J.: Games solved: Now and in the future. Artificial Intelligence 134, 277–311 (2002)
- (2002) Artificial Intelligence , vol.134 , pp. 277-311
- Van Den Herik, H.J.¹ Uiterwijk, J.W.H.M.² Van Rijswijck, J.³

51
- 0004128366
- Princeton University Press, Princeton
- Hsu, F.H.: Behind Deep Blue: Building the Computer that Defeated the World Chess Champion. Princeton University Press, Princeton (2002)
- (2002) Behind Deep Blue: Building the Computer that Defeated the World Chess Champion
- Hsu, F.H.¹

52
- 32144439231
- AI for dynamic difficult adjustment in games
- Hunicke, R., Chapman, V.: AI for dynamic difficult adjustment in games. In: Challenges in Game AI Workshop (2004)
- (2004) Challenges in Game AI Workshop
- Hunicke, R.¹ Chapman, V.²

53
- 33646243319
- A natural policy gradient
- Kakade, S.: A natural policy gradient. In: Advances in Neural Information Processing Systems, vol. 14, pp. 1531–1538 (2001)
- (2001) Advances in Neural Information Processing Systems , vol.14 , pp. 1531-1538
- Kakade, S.¹

54
- 84986621078
- On verifying game designs and playing strategies using reinforcement learning
- Kalles, D., Kanellopoulos, P.: On verifying game designs and playing strategies using reinforcement learning. In: ACM Symposium on Applied Computing, pp. 6–11 (2001)
- (2001) ACM Symposium on Applied Computing , pp. 6-11
- Kalles, D.¹ Kanellopoulos, P.²

55
- 85042912802
- BSc thesis
- Kerbusch, P.: Learning unit values in Wargus using temporal differences. BSc thesis (2005)
- (2005) Learning Unit Values in Wargus Using Temporal Differences
- Kerbusch, P.¹

56
- 33750293964
- Bandit Based Monte-Carlo Planning
- Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.), Springer, Heidelberg
- Kocsis, L., Szepesvári, C.: Bandit Based Monte-Carlo Planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006)
- (2006) ECML 2006. LNCS (LNAI) , vol.4212 , pp. 282-293
- Kocsis, L.¹ Szepesvári, C.²

57
- 77049089986
- RSPSA: Enhanced Parameter Optimization in Games
- van den Herik, H.J., Hsu, S.-C., Hsu, T.-s., Donkers, H.H.L.M(J.) (eds.), Springer, Heidelberg
- Kocsis, L., Szepesvári, C., Winands, M.H.M.: RSPSA: Enhanced Parameter Optimization in Games. In: van den Herik, H.J., Hsu, S.-C., Hsu, T.-s., Donkers, H.H.L.M(J.) (eds.) CG 2005. LNCS, vol. 4250, pp. 39–56. Springer, Heidelberg (2006)
- (2006) CG 2005. LNCS , vol.4250 , pp. 39-56
- Kocsis, L.¹ Szepesvári, C.² Winands, M.H.M.³

58
- 77957870581
- University of Utrecht, The Netherlands
- Kok, E.: Adaptive reinforcement learning agents in RTS games. Master’s thesis, University of Utrecht, The Netherlands (2008)
- (2008) Adaptive Reinforcement Learning Agents in RTS Games. Master’s Thesis
- Kok, E.¹

59
- 0003882343
- MIT Press
- Koza, J.: Genetic programming: on the programming of computers by means of natural selection. MIT Press (1992)
- (1992) Genetic Programming: On the Programming of Computers by means of Natural Selection
- Koza, J.¹

60
- 84941120617
- PhD thesis, University of Texas at Austin
- Kuhlmann, G.J.: Automated domain analysis and transfer learning in general game playing. PhD thesis, University of Texas at Austin (2010)
- (2010) Automated Domain Analysis and Transfer Learning in General Game Playing
- Kuhlmann, G.J.¹

61
- 35048819671
- Least-Squares Methods in Reinforcement Learning for Control
- Vlahavas, I.P., Spyropoulos, C.D. (eds.), Springer, Heidelberg
- Lagoudakis, M.G., Parr, R., Littman, M.L.: Least-Squares Methods in Reinforcement Learning for Control. In: Vlahavas, I.P., Spyropoulos, C.D. (eds.) SETN 2002. LNCS (LNAI), vol. 2308, pp. 249–260. Springer, Heidelberg (2002)
- (2002) SETN 2002. LNCS (LNAI) , vol.2308 , pp. 249-260
- Lagoudakis, M.G.¹ Parr, R.² Littman, M.L.³

62
- 70349283084
- Master’s thesis, University of Aarhus
- Laursen, R., Nielsen, D.: Investigating small scale combat situations in real time strategy computer games. Master’s thesis, University of Aarhus (2005)
- (2005) Investigating Small Scale Combat Situations in Real Time Strategy Computer Games
- Laursen, R.¹ Nielsen, D.²

63
- 84898646291
- Chess Neighborhoods, Function Combination, and Reinforcement Learning
- Marsland, T., Frank, I. (eds.), Springer, Heidelberg
- Levinson, R., Weber, R.: Chess Neighborhoods, Function Combination, and Reinforcement Learning. In: Marsland, T., Frank, I. (eds.) CG 2001. LNCS, vol. 2063, pp. 133–150. Springer, Heidelberg (2002)
- (2002) CG 2001. LNCS , vol.2063 , pp. 133-150
- Levinson, R.¹ Weber, R.²

64
- 33747193691
- Beyond Optimal Play in Two-Person-Zerosum Games
- Albers, S., Radzik, T. (eds.), Springer, Heidelberg
- Lorenz, U.: Beyond Optimal Play in Two-Person-Zerosum Games. In: Albers, S., Radzik, T. (eds.) ESA 2004. LNCS, vol. 3221, pp. 749–759. Springer, Heidelberg (2004)
- (2004) ESA 2004. LNCS , vol.3221 , pp. 749-759
- Lorenz, U.¹

65
- 84856026867
- Springer, Heidelberg
- Mańdziuk, J.: Knowledge-Free and Learning-Based Methods in Intelligent Game Playing. Springer, Heidelberg (2010)
- (2010) Knowledge-Free and Learning-Based Methods in Intelligent Game Playing
- Mańdziuk, J.¹

66
- 84955506462
- Writing Stratagus-playing agents in concurrent alisp
- Marthi, B., Russell, S., Latham, D.: Writing Stratagus-playing agents in concurrent alisp. In: IJCAI Workshop on Reasoning, Representation, and Learning in Computer Games, pp. 67–71 (2005)
- (2005) IJCAI Workshop on Reasoning, Representation, and Learning in Computer Games , pp. 67-71
- Marthi, B.¹ Russell, S.² Latham, D.³

67
- 33646264632
- Learning of AI players from game observation data
- McGlinchey, S.J.: Learning of AI players from game observation data. In: GAME-ON, pp. 106–110 (2003)
- (2003) GAME-ON , pp. 106-110
- McGlinchey, S.J.¹

68
- 84885198927
- Defeating novel opponents in a real-time strategy game
- Molineaux, M., Aha, D.W., Ponsen, M.: Defeating novel opponents in a real-time strategy game. In: IJCAI Workshop on Reasoning, Representation, and Learning in Computer Games, pp. 72–77 (2005)
- (2005) IJCAI Workshop on Reasoning, Representation, and Learning in Computer Games , pp. 72-77
- Molineaux, M.¹ Aha, D.W.² Ponsen, M.³

69
- 21844502480
- Discovering complex Othello strategies through evolutionary neural networks
- Moriarty, D.E., Miikkulainen, R.: Discovering complex Othello strategies through evolutionary neural networks. Connection Science 7, 195–209 (1995)
- (1995) Connection Science , vol.7 , pp. 195-209
- Moriarty, D.E.¹ Miikkulainen, R.²

70
- 24944583230
- Position evaluation in computer go
- Müller, M.: Position evaluation in computer go. ICGA Journal 25(4), 219–228 (2002)
- (2002) ICGA Journal , vol.25 , Issue.4 , pp. 219-228
- Müller, M.¹

71
- 84864656513
- Master’s thesis, University of Alberta
- Naddaf, Y.: Game-independent AI agents for playing Atari 2600 console games. Master’s thesis, University of Alberta (2010)
- (2010) Game-Independent AI Agents for Playing Atari 2600 Console Games
- Naddaf, Y.¹

72
- 84899013439
- Why did TD-Gammon work?
- Pollack, J.B., Blair, A.D.: Why did TD-Gammon work? In: Neural Information Processing Systems, vol. 9, pp. 10–16 (1997)
- (1997) Neural Information Processing Systems , vol.9 , pp. 10-16
- Pollack, J.B.¹ Blair, A.D.²

73
- 26944432123
- Improving adaptive game AI with evolutionary learning
- Ponsen, M., Spronck, P.: Improving adaptive game AI with evolutionary learning. In: Computer Games: Artificial Intelligence, Design and Education (2004)
- (2004) Computer Games: Artificial Intelligence, Design and Education
- Ponsen, M.¹ Spronck, P.²

74
- 85042915837
- Automatically acquiring adaptive real-time strategy game opponents using evolutionary learning
- Ponsen, M., Muñoz-Avila, H., Spronck, P., Aha, D.W.: Automatically acquiring adaptive real-time strategy game opponents using evolutionary learning. In: Proceedings of the 17th Innovative Applications of Artificial Intelligence Conference (2005)
- (2005) Proceedings of the 17Th Innovative Applications of Artificial Intelligence Conference
- Ponsen, M.¹ Muñoz-Avila, H.² Spronck, P.³ Aha, D.W.⁴

75
- 84898624697
- Hierarchical reinforcement learning in computer games
- Ponsen, M., Spronck, P., Tuyls, K.: Hierarchical reinforcement learning in computer games. In: Adaptive Learning Agents and Multi-Agent Systems, pp. 49–60 (2006)
- (2006) Adaptive Learning Agents and Multi-Agent Systems , pp. 49-60
- Ponsen, M.¹ Spronck, P.² Tuyls, K.³

76
- 77950871800
- Abstraction and Generalization in Reinforcement Learning: A Summary and Framework
- Taylor, M.E., Tuyls, K. (eds.), Springer, Heidelberg
- Ponsen, M., Taylor, M.E., Tuyls, K.: Abstraction and Generalization in Reinforcement Learning: A Summary and Framework. In: Taylor, M.E., Tuyls, K. (eds.) ALA 2009. LNCS, vol. 5924, pp. 1–33. Springer, Heidelberg (2010)
- (2010) ALA 2009. LNCS , vol.5924 , pp. 1-33
- Ponsen, M.¹ Taylor, M.E.² Tuyls, K.³

77
- 78650622420
- Adversarial search spaces and sampling-based planning
- Ramanujan, R., Sabharwal, A., Selman, B.: Adversarial search spaces and sampling-based planning. In: International Conference on Automated Planning and Scheduling (2010)
- (2010) International Conference on Automated Planning and Scheduling
- Ramanujan, R.¹ Sabharwal, A.² Selman, B.³

78
- 79953170325
- Using counterfactual regret minimization to create competitive multi-player poker agents
- Risk, N., Szafron, D.: Using counterfactual regret minimization to create competitive multi-player poker agents. In: International Conference on Autonomous Agents and Multiagent Systems, pp. 159–166 (2010)
- (2010) International Conference on Autonomous Agents and Multiagent Systems , pp. 159-166
- Risk, N.¹ Szafron, D.²

79
- 79953207627
- Computer poker: A review
- Rubin, J., Watson, I.: Computer poker: A review. Artificial Intelligence 175(5-6), 958–987 (2011)
- (2011) Artificial Intelligence , vol.175 , Issue.5-6 , pp. 958-987
- Rubin, J.¹ Watson, I.²

80
- 0000302898
- The games computers (And people) play
- Zelkowitz, M. (ed.), Academic Press
- Schaeffer, J.: The games computers (and people) play. In: Zelkowitz, M. (ed.) Advances in Computers, vol. 50, pp. 89–266. Academic Press (2000)
- (2000) Advances in Computers , vol.50 , pp. 89-266
- Schaeffer, J.¹

81
- 0038145011
- Temporal difference learning applied to a high-performance game-playing program
- Schaeffer, J., Hlynka, M., Jussila, V.: Temporal difference learning applied to a high-performance game-playing program. In: International Joint Conference on Artificial Intelligence, pp. 529–534 (2001)
- (2001) International Joint Conference on Artificial Intelligence , pp. 529-534
- Schaeffer, J.¹ Hlynka, M.² Jussila, V.³

82
- 78751687085
- Probabilistic state translation in extensive games with large action sets
- Schnizlein, D., Bowling, M., Szafron, D.: Probabilistic state translation in extensive games with large action sets. In: International Joint Conference on Artificial Intelligence, pp. 278–284 (2009)
- (2009) International Joint Conference on Artificial Intelligence , pp. 278-284
- Schnizlein, D.¹ Bowling, M.² Szafron, D.³

83
- 9444229856
- Learning to evaluate go positions via temporal difference methods
- Springer, Heidelberg
- Schraudolph, N.N., Dayan, P., Sejnowski, T.J.: Learning to evaluate go positions via temporal difference methods. In: Computational Intelligence in Games. Studies in Fuzziness and Soft Computing, ch. 4, vol. 62, pp. 77–98. Springer, Heidelberg (2001)
- (2001) Computational Intelligence in Games. Studies in Fuzziness and Soft Computing, Ch. 4 , vol.62 , pp. 77-98
- Schraudolph, N.N.¹ Dayan, P.² Sejnowski, T.J.³

84
- 33744791590
- The illusion of intelligence
- Charles River Media
- Scott, B.: The illusion of intelligence. In: AI Game Programming Wisdom, pp. 16–20. Charles River Media (2002)
- (2002) AI Game Programming Wisdom , pp. 16-20
- Scott, B.¹

85
- 0345014819
- Learning a Game Strategy Using Pattern-Weights and Self-Play
- Schaeffer, J., Müller, M., Björnsson, Y. (eds.), Springer, Heidelberg
- Shapiro, A., Fuchs, G., Levinson, R.: Learning a Game Strategy Using Pattern-Weights and Self-Play. In: Schaeffer, J., Müller, M., Björnsson, Y. (eds.) CG 2002. LNCS, vol. 2883, pp. 42–60. Springer, Heidelberg (2003)
- (2003) CG 2002. LNCS , vol.2883 , pp. 42-60
- Shapiro, A.¹ Fuchs, G.² Levinson, R.³

86
- 84883067875
- Learning companion behaviors using reinforcement learning in games
- Sharifi, A.A., Zhao, R., Szafron, D.: Learning companion behaviors using reinforcement learning in games. In: AIIDE (2010)
- (2010) AIIDE
- Sharifi, A.A.¹ Zhao, R.² Szafron, D.³

87
- 73449102527
- General game playing: An overview and open problems
- Sharma, S., Kobti, Z., Goodwin, S.: General game playing: An overview and open problems. In: International Conference on Computing, Engineering and Information, pp. 257–260 (2009)
- (2009) International Conference on Computing, Engineering and Information , pp. 257-260
- Sharma, S.¹ Kobti, Z.² Goodwin, S.³

88
- 71149102015
- Monte-carlo simulation balancing
- Silver, D., Tesauro, G.: Monte-carlo simulation balancing. In: International Conference on Machine Learning (2009)
- (2009) International Conference on Machine Learning
- Silver, D.¹ Tesauro, G.²

89
- 56449110907
- Sample-based learning and search with permanent and transient memories
- Silver, D., Sutton, R., Mueller, M.: Sample-based learning and search with permanent and transient memories. In: ICML (2008)
- (2008) ICML
- Silver, D.¹ Sutton, R.² Mueller, M.³

90
- 33749542723
- Difficulty scaling of game AI
- Spronck, P., Sprinkhuizen-Kuyper, I., Postma, E.: Difficulty scaling of game AI. In: GAME-ON 2004: 5th International Conference on Intelligent Games and Simulation (2004)
- (2004) GAME-ON 2004: 5Th International Conference on Intelligent Games and Simulation
- Spronck, P.¹ Sprinkhuizen-Kuyper, I.² Postma, E.³

91
- 33744791741
- Adaptive game AI with dynamic scripting
- Spronck, P., Ponsen, M., Sprinkhuizen-Kuyper, I., Postma, E.: Adaptive game AI with dynamic scripting. Machine Learning 63(3), 217–248 (2006)
- (2006) Machine Learning , vol.63 , Issue.3 , pp. 217-248
- Spronck, P.¹ Ponsen, M.² Sprinkhuizen-Kuyper, I.³ Postma, E.⁴

92
- 29244482280
- Real-time neuroevolution in the NERO video game
- Stanley, K.O., Bryant, B.D., Miikkulainen, R.: Real-time neuroevolution in the NERO video game. IEEE Transactions on Evolutionary Computation 9(6), 653–668 (2005)
- (2005) IEEE Transactions on Evolutionary Computation , vol.9 , Issue.6 , pp. 653-668
- Stanley, K.O.¹ Bryant, B.D.² Miikkulainen, R.³

93
- 38049011913
- Feature construction for reinforcement learning in Hearts
- Sturtevant, N., White, A.: Feature construction for reinforcement learning in Hearts. In: Advances in Computers and Games, pp. 122–134 (2007)
- (2007) Advances in Computers and Games , pp. 122-134
- Sturtevant, N.¹ White, A.²

94
- 80155138000
- Case-based reasoning for improved micromanagement in real-time strategy games
- Szczepański, T., Aamodt, A.: Case-based reasoning for improved micromanagement in real-time strategy games. In: Workshop on Case-Based Reasoning for Computer Games, 8th International Conference on Case-Based Reasoning, pp. 139–148 (2009)
- (2009) Workshop on Case-Based Reasoning for Computer Games, 8Th International Conference on Case-Based Reasoning , pp. 139-148
- Szczepański, T.¹ Aamodt, A.²

95
- 33845344721
- Learning Tetris using the noisy cross-entropy method
- Szita, I., Lörincz, A.: Learning Tetris using the noisy cross-entropy method. Neural Computation 18(12), 2936–2941 (2006a)
- (2006) Neural Computation , vol.18 , Issue.12 , pp. 2936-2941
- Szita, I.¹ Lörincz, A.²

96
- 38349162555
- Learning to play using low-complexity rule-based policies: Illustrations through Ms
- Szita, I., Lörincz, A.: Learning to play using low-complexity rule-based policies: Illustrations through Ms. Pac-Man. Journal of Articial Intelligence Research 30, 659–684 (2006b)
- (2006) Pac-Man. Journal of Articial Intelligence Research , vol.30 , pp. 659-684
- Szita, I.¹ Lörincz, A.²

97
- 84897523719
- Sz-tetris as a benchmark for studying key problems of rl
- Szita, I., Szepesvári, C.: Sz-tetris as a benchmark for studying key problems of rl. In: ICML 2010 Workshop on Machine Learning and Games (2010)
- (2010) ICML 2010 Workshop on Machine Learning and Games
- Szita, I.¹ Szepesvári, C.²

98
- 77953795616
- Monte-Carlo Tree Search in Settlers of Catan
- van den Herik, H.J., Spronck, P. (eds.), Springer, Heidelberg
- Szita, I., Chaslot, G., Spronck, P.: Monte-Carlo Tree Search in Settlers of Catan. In: van den Herik, H.J., Spronck, P. (eds.) ACG 2009. LNCS, vol. 6048, pp. 21–32. Springer, Heidelberg (2010)
- (2010) ACG 2009. LNCS , vol.6048 , pp. 21-32
- Szita, I.¹ Chaslot, G.² Spronck, P.³

99
- 0001046225
- Practical issues in temporal difference learning
- Tesauro, G.: Practical issues in temporal difference learning. Machine Learning 8, 257–277 (1992)
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, G.¹

100
- 0029276036
- Temporal difference learning and TD-gammon
- Tesauro, G.: Temporal difference learning and TD-gammon. Communications of the ACM 38(3), 58–68 (1995)
- (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

101
- 0032156140
- Comments on co-evolution in the successful learning of backgammon strategy
- Tesauro, G.: Comments on co-evolution in the successful learning of backgammon strategy’. Machine Learning 32(3), 241–243 (1998)
- (1998) Machine Learning , vol.32 , Issue.3 , pp. 241-243
- Tesauro, G.¹

102
- 0036147771
- Programming backgammon using self-teaching neural nets
- Tesauro, G.: Programming backgammon using self-teaching neural nets. Artificial Intelligence 134(1-2), 181–199 (2002)
- (2002) Artificial Intelligence , vol.134 , Issue.1-2 , pp. 181-199
- Tesauro, G.¹

103
- 70350140182
- Building controllers for Tetris
- Thiery, C., Scherrer, B.: Building controllers for Tetris. ICGA Journal 32(1), 3–11 (2009)
- (2009) ICGA Journal , vol.32 , Issue.1 , pp. 3-11
- Thiery, C.¹ Scherrer, B.²

104
- 85153958149
- Learning to play the game of chess
- Thrun, S.: Learning to play the game of chess. In: Neural Information Processing Systems, vol. 7, pp. 1069–1076 (1995)
- (1995) Neural Information Processing Systems , vol.7 , pp. 1069-1076
- Thrun, S.¹

105
- 27844446638
- Feature construction for game playing
- Fürnkranz, J., Kubat, M. (eds.), Nova Science Publishers
- Utgoff, P.: Feature construction for game playing. In: Fürnkranz, J., Kubat, M. (eds.) Machines that Learn to Play Games, pp. 131–152. Nova Science Publishers (2001)
- (2001) Machines that Learn to Play Games , pp. 131-152
- Utgoff, P.¹

106
- 2342555200
- Constructive function approximation
- Liu, H., Motoda, H. (eds.), Kluwer Academic Publishers
- Utgoff, P., Precup, D.: Constructive function approximation. In: Liu, H., Motoda, H. (eds.) Feature Extraction, Construction and Selection: A Data Mining Perspective, vol. 453, pp. 219–235. Kluwer Academic Publishers (1998)
- (1998) Feature Extraction, Construction and Selection: A Data Mining Perspective , vol.453 , pp. 219-235
- Utgoff, P.¹ Precup, D.²

107
- 84858720579
- Bootstrapping from game tree search
- Veness, J., Silver, D., Uther, W., Blair, A.: Bootstrapping from game tree search. In: Neural Information Processing Systems, vol. 22, pp. 1937–1945 (2009)
- (2009) Neural Information Processing Systems , vol.22 , pp. 1937-1945
- Veness, J.¹ Silver, D.² Uther, W.³ Blair, A.⁴

108
- 84883083750
- Case-based reasoning for build order in real-time strategy games
- Weber, B.G., Mateas, M.: Case-based reasoning for build order in real-time strategy games. In: Artificial Intelligence and Interactive Digital Entertainment, pp. 1313–1318 (2009)
- (2009) Artificial Intelligence and Interactive Digital Entertainment , pp. 1313-1318
- Weber, B.G.¹ Mateas, M.²

109
- 70349301926
- Using reinforcement learning for city site selection in the turn-based strategy game Civilization IV
- Wender, S., Watson, I.: Using reinforcement learning for city site selection in the turn-based strategy game Civilization IV. In: Computational Intelligence and Games, pp. 372–377 (2009)
- (2009) Computational Intelligence and Games , pp. 372-377
- Wender, S.¹ Watson, I.²

110
- 82655164054
- Self-play and using an expert to learn to play backgammon with temporal difference learning
- Wiering, M.A.: Self-play and using an expert to learn to play backgammon with temporal difference learning. Journal of Intelligent Learning Systems and Applications 2, 57–68 (2010)
- (2010) Journal of Intelligent Learning Systems and Applications , vol.2 , pp. 57-68
- Wiering, M.A.¹

111
- 85162042235
- Regret minimization in games with incomplete information
- Zinkevich, M., Johanson, M., Bowling, M., Piccione, C.: Regret minimization in games with incomplete information. In: Neural Information Processing Systems, pp. 1729–1736 (2008)
- (2008) Neural Information Processing Systems , pp. 1729-1736
- Zinkevich, M.¹ Johanson, M.² Bowling, M.³ Piccione, C.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.