SCOPUS 정보 검색 플랫폼

IEEE Transactions on Evolutionary Computation

Volumn 9, Issue 6, 2005, Pages 628-640

Coevolution versus self-play temporal difference learning for acquiring position evaluation in small-board go

(2) Runarsson, Thomas Philip a Lucas, Simon M b

a UNIVERSITY OF ICELAND (Iceland)

b UNIVERSITY OF ESSEX (United Kingdom)

Author keywords

Coevolution; Evolution strategy; Game Strategies; Reinforcement learning; Temporal difference learning

Indexed keywords

EVOLUTIONARY ALGORITHMS; GAME THEORY; GRADIENT METHODS; LEARNING ALGORITHMS; OPTIMIZATION;

COEVOLUTION; EVOLUTION STRATEGY; GAME STRATEGIES; REINFORCEMENT LEARNING; TEMPORAL DIFFERENCE LEARNING;

LEARNING SYSTEMS;

EID: 29244474089 PISSN: 1089778X EISSN: None Source Type: Journal
DOI: 10.1109/TEVC.2005.856212 Document Type: Article

Times cited : (59)

References (46)

1
- 0000218399
- "Programming a computer for playing chess
- C. Shannon, "Programming a computer for playing chess, Philos. Mag., vol. 41, pp. 256-275, 1950.
- (1950) Philos. Mag. , vol.41 , pp. 256-275
- Shannon, C.¹

2
- 0001201756
- "Some studies in machine learning using the game of checkers"
- A. Samuel, "Some studies in machine learning using the game of checkers," IBM J. Res. Develop., vol. 3, pp. 211-229, 1959.
- (1959) IBM J. Res. Develop. , vol.3 , pp. 211-229
- Samuel, A.¹

3
- 0013395102
- "Trial and error"
- Baltimore,MD: Penguin
- D. Michie, "Trial and error," in Science Survey, Part 2. Baltimore,MD: Penguin, 1961, pp. 129-145.
- (1961) Science Survey, Part 2 , pp. 129-145
- Michie, D.¹

4
- 0032156067
- "Co-evolution in the successful learning of backgammon strategy"
- J. Pollack and A. Blair, "Co-evolution in the successful learning of backgammon strategy," Mach. Learn., vol. 32, pp. 225-240, 1998.
- (1998) Mach. Learn. , vol.32 , pp. 225-240
- Pollack, J.¹ Blair, A.²

5
- 0033325801
- "Evolving neural networks to play checkers without expert knowledge"
- K. Chellapilla and D. Fogel, "Evolving neural networks to play checkers without expert knowledge," IEEE Trans. Neural Netw., vol. 10, no. 6, pp. 1382-1391, 1999.
- (1999) IEEE Trans. Neural Netw. , vol.10 , Issue.6 , pp. 1382-1391
- Chellapilla, K.¹ Fogel, D.²

6
- 0035415173
- "Evolving an expert checkers playing program without using human expertise"
- Oct.
- K. Chellapilla and D. Fogel, "Evolving an expert checkers playing program without using human expertise," IEEE Trans. Evol. Comput., vol. 5, no. 5, pp. 422-428, Oct. 2001.
- (2001) IEEE Trans. Evol. Comput. , vol.5 , Issue.5 , pp. 422-428
- Chellapilla, K.¹ Fogel, D.²

7
- 0345407827
- San Mateo, CA: Morgan Kaufmann
- D. Fogel, Blondie24: playing at the edge of AI. San Mateo, CA: Morgan Kaufmann, 2002.
- (2002) Blondie24: Playing at the Edge of AI
- Fogel, D.¹

8
- 1942420841
- "The significance of temporal-difference learning in self-play training: TD-rummy versus EVO-rummy"
- C. Kotnik and J. Kalita, "The significance of temporal-difference learning in self-play training: TD-rummy versus EVO-rummy," in Proc. Int. Conf. Mach. Learn. (ICML), 2003, pp. 369-375.
- (2003) Proc. Int. Conf. Mach. Learn. (ICML) , pp. 369-375
- Kotnik, C.¹ Kalita, J.²

9
- 0034876587
- "Why co-evolution beats temporal difference learning at backgammon for a linear architecture, but not a nonlinear architecture
- [Online]. Available: citeseer.ist.psu.edu/article/darwenO1why.html
- P. J. Darwen, "Why co-evolution beats temporal difference learning at backgammon for a linear architecture, but not a nonlinear architecture;' in Proc. Congr. Evol. Comput., 2001, pp. 1003-1010. [Online]. Available: citeseer.ist.psu.edu/article/darwenO1why.html.
- (2001) Proc. Congr. Evol. Comput. , pp. 1003-1010
- Darwen, P.J.¹

10
- 0032156140
- "Comments on 'co-evolution in the successful learning of backgammon strategy'"
- G. Tesauro, "Comments on 'co-evolution in the successful learning of backgammon strategy'" Mach. Learn., vol. 32, no. 3, pp. 241-243, 1998.
- (1998) Mach. Learn. , vol.32 , Issue.3 , pp. 241-243
- Tesauro, G.¹

11
- 0029276036
- "Temporal difference learning and TD-gammon"
- G. Tesauro, "Temporal difference learning and TD-gammon," Commun. ACM, vol. 38, no. 3, pp. 58-68, 1995.
- (1995) Commun. ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

12
- 35048832783
- "Evolving a roving eye for go"
- K. Stanley and R. Miikkulainen, "Evolving a roving eye for go," in Proc. Genetic Evol. Comput. Conf., 2004, pp. 1226-1238.
- (2004) Proc. Genetic Evol. Comput. Conf. , pp. 1226-1238
- Stanley, K.¹ Miikkulainen, R.²

13
- 0031682491
- "Evolving neural networks to play go"
- N. Richards, D. Moriarty, and R. Miikkulainen, "Evolving neural networks to play go, Appl. Intell., vol. 8, pp. 85-96, 1998.
- (1998) Appl. Intell. , vol.8 , pp. 85-96
- Richards, N.¹ Moriarty, D.² Miikkulainen, R.³

14
- 21244455604
- "An evolutionary self-learning chess program"
- Dec.
- D. Fogel, T. Hays, S. Hahn, and J. Quon, "An evolutionary self-learning chess program," Proc. IEEE, pp. 1947-1954, Dec. 2004.
- (2004) Proc. IEEE , pp. 1947-1954
- Fogel, D.¹ Hays, T.² Hahn, S.³ Quon, J.⁴

15
- 29244440658
- "Evolving speciated checkers players with crowding algorithm"
- K.-J. Kim and S.-B. Cho, "Evolving speciated checkers players with crowding algorithm," in Proc. Congr. Evolut. Comput., vol. 1, 2002, pp. 407-412.
- (2002) Proc. Congr. Evolut. Comput. , vol.1 , pp. 407-412
- Kim, K.-J.¹ Cho, S.-B.²

16
- 0004156494
- "Reinforcement learning through evolutionary computation"
- D. Moriarty, A. Schultz, and J. Grefenstette, "Reinforcement learning through evolutionary computation," J. Artif. Intell. Res., vol. 11, pp. 241-276, 1999.
- (1999) J. Artif. Intell. Res. , vol.11 , pp. 241-276
- Moriarty, D.¹ Schultz, A.² Grefenstette, J.³

17
- 84901478156
- "Comparing PSO structures to learn the game of checkers from zero knowledge"
- N. Franken and A. Engelbrecht, "Comparing PSO structures to learn the game of checkers from zero knowledge," in Proc. Congr. Evol. Comput., 2003, pp. 234-241.
- (2003) Proc. Congr. Evol. Comput. , pp. 234-241
- Franken, N.¹ Engelbrecht, A.²

18
- 3142749146
- "Learning to play games using a PSO-based competitive learning approach"
- Jun.
- L. Messerschmidt and A. Engelbrecht, "Learning to play games using a PSO-based competitive learning approach," IEEE Trans. Evol. Comput., vol. 8, no. 3, pp. 280-288, Jun. 2004.
- (2004) IEEE Trans. Evol. Comput. , vol.8 , Issue.3 , pp. 280-288
- Messerschmidt, L.¹ Engelbrecht, A.²

19
- 0003420416
- Cambridge, MA: MIT Press
- R. Sutton and A. Barto, Introduction to Reinforcement Learning. Cambridge, MA: MIT Press, 1998.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.¹ Barto, A.²

20
- 84943227555
- "Using evolutionary programming to create networks that are capable of playing tic-tac-toe "
- D. Fogel, "Using evolutionary programming to create networks that are capable of playing tic-tac-toe," in Proc. IEEE Int. Conf. Neural Netw., 1993, pp. 875-880.
- (1993) Proc. IEEE Int. Conf. Neural Netw. , pp. 875-880
- Fogel, D.¹

21
- 0001747145
- "Competitive environments evolve better solutions for complex tasks"
- S. Forrest, Ed
- P. J. Angeline and J. B. Pollack, "Competitive environments evolve better solutions for complex tasks," in Proc. 5th Int. Conf. Genetic Algorithms, S. Forrest, Ed., 1993, pp. 264-270.
- (1993) Proc. 5th Int. Conf. Genetic Algorithms , pp. 264-270
- Angeline, P.J.¹ Pollack, J.B.²

22
- 84958954854
- "Learning on graphs in the game of Go"
- T. Graepel, M. Goutrie, M. Krüger, and R. Herbrich, "Learning on graphs in the game of Go," in Proc. Int. Conf. Artif. Neural Netw., 2001, pp. 347-352.
- (2001) Proc. Int. Conf. Artif. Neural Netw. , pp. 347-352
- Graepel, T.¹ Goutrie, M.² Krüger, M.³ Herbrich, R.⁴

23
- 0000433333
- "Temporal difference learning of position evaluation in the game of Go"
- N. Schraudolph, P. Dayan, and T. Sejnowski, "Temporal difference learning of position evaluation in the game of Go," in Proc. Neural Inf. Process. Syst. 6, 1994, pp. 817-824.
- (1994) Proc. Neural Inf. Process. Syst. , vol.6 , pp. 817-824
- Schraudolph, N.¹ Dayan, P.² Sejnowski, T.³

24
- 84902482960
- "Evaluation in Go by a neural network using soft segmentation"
- M. Enzenberger, "Evaluation in Go by a neural network using soft segmentation," in Proc. 10th Advances Comput. Games Conf., 2003, pp. 97-108.
- (2003) Proc. 10th Advances Comput. Games Conf. , pp. 97-108
- Enzenberger, M.¹

25
- 4344629171
- "An investigation of an evolutionary approach of the opening of Go"
- G. Kendall, R. Yaakob, and P. Hingston, "An investigation of an evolutionary approach of the opening of Go," in Proc. Congr. Evol. Comput., 2004, pp. 2052-2059.
- (2004) Proc. Congr. Evol. Comput. , pp. 2052-2059
- Kendall, G.¹ Yaakob, R.² Hingston, P.³

26
- 4243659908
- Knowledge representation in the many faces of Go
- Tech. Rep., American Go Association. [Online]. Available: ftp:// bsd-server.ucsfedu/Go/comp/mfg.Z
- D. Fotland. (1993) Knowledge representation in the many faces of Go. Tech. Rep., American Go Association. [Online]. Available: ftp:// bsd-server.ucsfedu/Go/comp/mfg.Z
- (1993)
- Fotland, D.¹

27
- 0036149616
- "Computer go"
- M. Muller, "Computer go," Artif. Intell., vol. 134, pp. 145-179, 2002.
- (2002) Artif. Intell. , vol.134 , pp. 145-179
- Muller, M.¹

28
- 0003582853
- The integration of a priori knowledge into a Go playing neural network
- [Online]. Available
- M. Enzenberger. (1996) The integration of a priori knowledge into a Go playing neural network. [Online]. Available: http://www. markus-enzenberger.de/neurogol996.html
- (1996)
- Enzenberger, M.¹

29
- 4344712038
- "Solution Concepts in Coevolutionary Algorithms"
- Ph.D. dissertation, Brandeis Univ., Waltham, MA
- S. Ficici, "Solution Concepts in Coevolutionary Algorithms," Ph.D. dissertation, Brandeis Univ., Waltham, MA, 2004.
- (2004)
- Ficici, S.¹

30
- 0001118093
- "Co-evolving parasites improves simulated evolution as an optimization procedure"
- D. Hillis, "Co-evolving parasites improves simulated evolution as an optimization procedure," Physica D, vol. 42, pp. 228-234, 1990.
- (1990) Physica D , vol.42 , pp. 228-234
- Hillis, D.¹

31
- 2542521849
- "Co-evolving a Go-playing neural network"
- A. Lubberts and R. Miikkulainen, "Co-evolving a Go-playing neural network," in Proc. Genetic Evol. Comput. Conf. Workshop Program, 2001, pp. 14-19.
- (2001) Proc. Genetic Evol. Comput. Conf. Workshop Program , pp. 14-19
- Lubberts, A.¹ Miikkulainen, R.²

32
- 0023331258
- "An introduction to computing with neural nets"
- Apr.
- R. Lippmann, "An introduction to computing with neural nets," IEEE ASSP Mag., pp. 4-22, Apr. 1987.
- (1987) IEEE ASSP Mag. , pp. 4-22
- Lippmann, R.¹

33
- 84901411850
- "Piece difference: Simple to evolve?"
- E. Hughes, "Piece difference: Simple to evolve?," in Proc. Congr. Evol. Comput., 2003, pp. 2470-2473.
- (2003) Proc. Congr. Evol. Comput. , pp. 2470-2473
- Hughes, E.¹

34
- 4344679259
- "Competitive coevolution through evolutionary complexification"
- K. Stanley and R. Miikulainen, "Competitive coevolution through evolutionary complexification," J. Artificial Intell. Res., vol. 21, pp. 63-100, 2004.
- (2004) J. Artificial Intell. Res. , vol.21 , pp. 63-100
- Stanley, K.¹ Miikulainen, R.²

35
- 0002933170
- "Designing neural networks using genetic algorithm with graph generation system"
- H. Kitano, "Designing neural networks using genetic algorithm with graph generation system," Complex Syst., vol. 4, pp. 461-476, 1990.
- (1990) Complex Syst. , vol.4 , pp. 461-476
- Kitano, H.¹

36
- 0011246591
- "Cellular encoding of genetic neural networks"
- Ecole Normale Superieure de Lyon, Laboratoire de l'Informatique du Parallelisme, Tech. Rep
- F. Gruau, "Cellular encoding of genetic neural networks," Ecole Normale Superieure de Lyon, Laboratoire de l'Informatique du Parallelisme, Tech. Rep. 92-21, 1992.
- (1992) , pp. 21-92
- Gruau, F.¹

37
- 0012329219
- "A comparison between cellular encoding and direct encoding for genetic neural networks"
- Neuro-colt Tech. Rep. NC-TR-96-048
- F. Gruau, D. Whitley, and L. Pyeatt, "A comparison between cellular encoding and direct encoding for genetic neural networks," Neuro-colt Tech. Rep. NC-TR-96-048, 1996.
- (1996)
- Gruau, F.¹ Whitley, D.² Pyeatt, L.³

38
- 0038145011
- "Temporal difference learning applied to a high-performance game-playing program"
- J. Schaeffer, M. Hlynka, and V. Jussila, "Temporal difference learning applied to a high-performance game-playing program," in Proc. Int. Joint Conf. Artif. Intell., 2001, pp. 529-534.
- (2001) Proc. Int. Joint Conf. Artif. Intell. , pp. 529-534
- Schaeffer, J.¹ Hlynka, M.² Jussila, V.³

39
- 9444260194
- "Solving go on small boards"
- E. Ven Der Werf, J. Ven Den Herik, and J. Uiterwijk, "Solving go on small boards," ICGA J., vol. 26, no. 2, pp. 92-107, 2003.
- (2003) ICGA J. , vol.26 , Issue.2 , pp. 92-107
- Ven Der Werf, E.¹ Ven Den Herik, J.² Uniterwijk, J.³

40
- 27844439040
- "AI techniques for the game of Go"
- Ph.D. dissertation, Maastricht Univ., Maastricht, The Netherlands
- E. Van Der Werf, "AI techniques for the game of Go," Ph.D. dissertation, Maastricht Univ., Maastricht, The Netherlands, 2005.
- (2005)
- Van Der Werf, E.¹

41
- 0035479281
- "Computer Go: An AI-oriented survey"
- B. Bouzy and I Cazenave, "Computer Go: An AI-oriented survey," Artif. Intell. J., pp. 39-103, 2001.
- (2001) Artif. Intell. J. , pp. 39-103
- Bouzy, B.¹ Cazenave, I.²

42
- 0002882372
- "Knightcap: A chess program that learns by combining TD(λ) with game-tree search"
- [Online]. Available: citeseer.ist.psu.edu/baxter98knightcap.html
- J. Baxter, A. Trigdell, and L. Weaver, "Knightcap: A chess program that learns by combining TD(λ) with game-tree search," in Proc. 15th Int. Conf. Mach. Learn., 1998, pp. 28-36. [Online]. Available: citeseer.ist.psu.edu/baxter98knightcap.html.
- (1998) Proc. 15th Int. Conf. Mach. Learn. , pp. 28-36
- Baxter, J.¹ Trigdell, A.² Weaver, L.³

43
- 0003827650
- Stuttgart, Germany: Frommann-Holzboog
- I. Rechenberg, Evolutionstrategie'94. Stuttgart, Germany: Frommann-Holzboog, 1994.
- (1994) Evolutionstrategie'94
- Rechenberg, I.¹

44
- 84944327472
- "Reducing random fluctuations in mutative self-adaptation"
- Berlin, Germany: Springer-Verlag Parallel Problem Solving from Nature VII (PPSN-2002)
- T. P. Runarsson, "Reducing random fluctuations in mutative self-adaptation," in Lecture Notes in Computer Science. Berlin, Germany: Springer-Verlag, 2002, vol. 2439, Parallel Problem Solving from Nature VII (PPSN-2002), pp. 194-203.
- (2002) Lecture Notes in Computer Science , vol.2439 , pp. 194-203
- Runarsson, T.P.¹

45
- 0033362601
- "Evolving artificial neural networks"
- X. Yao, "Evolving artificial neural networks,"Proc. IEEE, vol. 87, no. 9, pp. 1423-1447, 1999.
- (1999) Proc. IEEE , vol.87 , Issue.9 , pp. 1423-1447
- Yao, X.¹

46
- 29244435277
- "Forward-backward building blocks for evolving neural networks with intrinsic learning behaviors"
- Berlin, Germany: Springer-Verlag
- S. Lucas, "Forward-backward building blocks for evolving neural networks with intrinsic learning behaviors," in Lecture Notes in Computer Science (1240): Biological and Artificial Computation: From Neuro-science to Technology. Berlin, Germany: Springer-Verlag, 1997, pp. 723-732.
- (1997) Lecture Notes in Computer Science (1240) Biological and Artificial Computation: From Neuro-science to Technology , pp. 723-732
- Lucas, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.