메뉴 건너뛰기




Volumn 9, Issue 6, 2005, Pages 628-640

Coevolution versus self-play temporal difference learning for acquiring position evaluation in small-board go

Author keywords

Coevolution; Evolution strategy; Game Strategies; Reinforcement learning; Temporal difference learning

Indexed keywords

EVOLUTIONARY ALGORITHMS; GAME THEORY; GRADIENT METHODS; LEARNING ALGORITHMS; OPTIMIZATION;

EID: 29244474089     PISSN: 1089778X     EISSN: None     Source Type: Journal    
DOI: 10.1109/TEVC.2005.856212     Document Type: Article
Times cited : (59)

References (46)
  • 1
    • 0000218399 scopus 로고
    • "Programming a computer for playing chess
    • C. Shannon, "Programming a computer for playing chess, Philos. Mag., vol. 41, pp. 256-275, 1950.
    • (1950) Philos. Mag. , vol.41 , pp. 256-275
    • Shannon, C.1
  • 2
    • 0001201756 scopus 로고
    • "Some studies in machine learning using the game of checkers"
    • A. Samuel, "Some studies in machine learning using the game of checkers," IBM J. Res. Develop., vol. 3, pp. 211-229, 1959.
    • (1959) IBM J. Res. Develop. , vol.3 , pp. 211-229
    • Samuel, A.1
  • 3
    • 0013395102 scopus 로고
    • "Trial and error"
    • Baltimore,MD: Penguin
    • D. Michie, "Trial and error," in Science Survey, Part 2. Baltimore,MD: Penguin, 1961, pp. 129-145.
    • (1961) Science Survey, Part 2 , pp. 129-145
    • Michie, D.1
  • 4
    • 0032156067 scopus 로고    scopus 로고
    • "Co-evolution in the successful learning of backgammon strategy"
    • J. Pollack and A. Blair, "Co-evolution in the successful learning of backgammon strategy," Mach. Learn., vol. 32, pp. 225-240, 1998.
    • (1998) Mach. Learn. , vol.32 , pp. 225-240
    • Pollack, J.1    Blair, A.2
  • 5
    • 0033325801 scopus 로고    scopus 로고
    • "Evolving neural networks to play checkers without expert knowledge"
    • K. Chellapilla and D. Fogel, "Evolving neural networks to play checkers without expert knowledge," IEEE Trans. Neural Netw., vol. 10, no. 6, pp. 1382-1391, 1999.
    • (1999) IEEE Trans. Neural Netw. , vol.10 , Issue.6 , pp. 1382-1391
    • Chellapilla, K.1    Fogel, D.2
  • 6
    • 0035415173 scopus 로고    scopus 로고
    • "Evolving an expert checkers playing program without using human expertise"
    • Oct.
    • K. Chellapilla and D. Fogel, "Evolving an expert checkers playing program without using human expertise," IEEE Trans. Evol. Comput., vol. 5, no. 5, pp. 422-428, Oct. 2001.
    • (2001) IEEE Trans. Evol. Comput. , vol.5 , Issue.5 , pp. 422-428
    • Chellapilla, K.1    Fogel, D.2
  • 8
    • 1942420841 scopus 로고    scopus 로고
    • "The significance of temporal-difference learning in self-play training: TD-rummy versus EVO-rummy"
    • C. Kotnik and J. Kalita, "The significance of temporal-difference learning in self-play training: TD-rummy versus EVO-rummy," in Proc. Int. Conf. Mach. Learn. (ICML), 2003, pp. 369-375.
    • (2003) Proc. Int. Conf. Mach. Learn. (ICML) , pp. 369-375
    • Kotnik, C.1    Kalita, J.2
  • 9
    • 0034876587 scopus 로고    scopus 로고
    • "Why co-evolution beats temporal difference learning at backgammon for a linear architecture, but not a nonlinear architecture
    • [Online]. Available: citeseer.ist.psu.edu/article/darwenO1why.html
    • P. J. Darwen, "Why co-evolution beats temporal difference learning at backgammon for a linear architecture, but not a nonlinear architecture;' in Proc. Congr. Evol. Comput., 2001, pp. 1003-1010. [Online]. Available: citeseer.ist.psu.edu/article/darwenO1why.html.
    • (2001) Proc. Congr. Evol. Comput. , pp. 1003-1010
    • Darwen, P.J.1
  • 10
    • 0032156140 scopus 로고    scopus 로고
    • "Comments on 'co-evolution in the successful learning of backgammon strategy'"
    • G. Tesauro, "Comments on 'co-evolution in the successful learning of backgammon strategy'" Mach. Learn., vol. 32, no. 3, pp. 241-243, 1998.
    • (1998) Mach. Learn. , vol.32 , Issue.3 , pp. 241-243
    • Tesauro, G.1
  • 11
    • 0029276036 scopus 로고
    • "Temporal difference learning and TD-gammon"
    • G. Tesauro, "Temporal difference learning and TD-gammon," Commun. ACM, vol. 38, no. 3, pp. 58-68, 1995.
    • (1995) Commun. ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 14
    • 21244455604 scopus 로고    scopus 로고
    • "An evolutionary self-learning chess program"
    • Dec.
    • D. Fogel, T. Hays, S. Hahn, and J. Quon, "An evolutionary self-learning chess program," Proc. IEEE, pp. 1947-1954, Dec. 2004.
    • (2004) Proc. IEEE , pp. 1947-1954
    • Fogel, D.1    Hays, T.2    Hahn, S.3    Quon, J.4
  • 15
    • 29244440658 scopus 로고    scopus 로고
    • "Evolving speciated checkers players with crowding algorithm"
    • K.-J. Kim and S.-B. Cho, "Evolving speciated checkers players with crowding algorithm," in Proc. Congr. Evolut. Comput., vol. 1, 2002, pp. 407-412.
    • (2002) Proc. Congr. Evolut. Comput. , vol.1 , pp. 407-412
    • Kim, K.-J.1    Cho, S.-B.2
  • 16
    • 0004156494 scopus 로고    scopus 로고
    • "Reinforcement learning through evolutionary computation"
    • D. Moriarty, A. Schultz, and J. Grefenstette, "Reinforcement learning through evolutionary computation," J. Artif. Intell. Res., vol. 11, pp. 241-276, 1999.
    • (1999) J. Artif. Intell. Res. , vol.11 , pp. 241-276
    • Moriarty, D.1    Schultz, A.2    Grefenstette, J.3
  • 17
    • 84901478156 scopus 로고    scopus 로고
    • "Comparing PSO structures to learn the game of checkers from zero knowledge"
    • N. Franken and A. Engelbrecht, "Comparing PSO structures to learn the game of checkers from zero knowledge," in Proc. Congr. Evol. Comput., 2003, pp. 234-241.
    • (2003) Proc. Congr. Evol. Comput. , pp. 234-241
    • Franken, N.1    Engelbrecht, A.2
  • 18
    • 3142749146 scopus 로고    scopus 로고
    • "Learning to play games using a PSO-based competitive learning approach"
    • Jun.
    • L. Messerschmidt and A. Engelbrecht, "Learning to play games using a PSO-based competitive learning approach," IEEE Trans. Evol. Comput., vol. 8, no. 3, pp. 280-288, Jun. 2004.
    • (2004) IEEE Trans. Evol. Comput. , vol.8 , Issue.3 , pp. 280-288
    • Messerschmidt, L.1    Engelbrecht, A.2
  • 20
    • 84943227555 scopus 로고
    • "Using evolutionary programming to create networks that are capable of playing tic-tac-toe "
    • D. Fogel, "Using evolutionary programming to create networks that are capable of playing tic-tac-toe," in Proc. IEEE Int. Conf. Neural Netw., 1993, pp. 875-880.
    • (1993) Proc. IEEE Int. Conf. Neural Netw. , pp. 875-880
    • Fogel, D.1
  • 21
    • 0001747145 scopus 로고
    • "Competitive environments evolve better solutions for complex tasks"
    • S. Forrest, Ed
    • P. J. Angeline and J. B. Pollack, "Competitive environments evolve better solutions for complex tasks," in Proc. 5th Int. Conf. Genetic Algorithms, S. Forrest, Ed., 1993, pp. 264-270.
    • (1993) Proc. 5th Int. Conf. Genetic Algorithms , pp. 264-270
    • Angeline, P.J.1    Pollack, J.B.2
  • 23
    • 0000433333 scopus 로고
    • "Temporal difference learning of position evaluation in the game of Go"
    • N. Schraudolph, P. Dayan, and T. Sejnowski, "Temporal difference learning of position evaluation in the game of Go," in Proc. Neural Inf. Process. Syst. 6, 1994, pp. 817-824.
    • (1994) Proc. Neural Inf. Process. Syst. , vol.6 , pp. 817-824
    • Schraudolph, N.1    Dayan, P.2    Sejnowski, T.3
  • 24
    • 84902482960 scopus 로고    scopus 로고
    • "Evaluation in Go by a neural network using soft segmentation"
    • M. Enzenberger, "Evaluation in Go by a neural network using soft segmentation," in Proc. 10th Advances Comput. Games Conf., 2003, pp. 97-108.
    • (2003) Proc. 10th Advances Comput. Games Conf. , pp. 97-108
    • Enzenberger, M.1
  • 25
    • 4344629171 scopus 로고    scopus 로고
    • "An investigation of an evolutionary approach of the opening of Go"
    • G. Kendall, R. Yaakob, and P. Hingston, "An investigation of an evolutionary approach of the opening of Go," in Proc. Congr. Evol. Comput., 2004, pp. 2052-2059.
    • (2004) Proc. Congr. Evol. Comput. , pp. 2052-2059
    • Kendall, G.1    Yaakob, R.2    Hingston, P.3
  • 26
    • 4243659908 scopus 로고
    • Knowledge representation in the many faces of Go
    • Tech. Rep., American Go Association. [Online]. Available: ftp:// bsd-server.ucsfedu/Go/comp/mfg.Z
    • D. Fotland. (1993) Knowledge representation in the many faces of Go. Tech. Rep., American Go Association. [Online]. Available: ftp:// bsd-server.ucsfedu/Go/comp/mfg.Z
    • (1993)
    • Fotland, D.1
  • 27
    • 0036149616 scopus 로고    scopus 로고
    • "Computer go"
    • M. Muller, "Computer go," Artif. Intell., vol. 134, pp. 145-179, 2002.
    • (2002) Artif. Intell. , vol.134 , pp. 145-179
    • Muller, M.1
  • 28
    • 0003582853 scopus 로고    scopus 로고
    • The integration of a priori knowledge into a Go playing neural network
    • [Online]. Available
    • M. Enzenberger. (1996) The integration of a priori knowledge into a Go playing neural network. [Online]. Available: http://www. markus-enzenberger.de/neurogol996.html
    • (1996)
    • Enzenberger, M.1
  • 29
    • 4344712038 scopus 로고    scopus 로고
    • "Solution Concepts in Coevolutionary Algorithms"
    • Ph.D. dissertation, Brandeis Univ., Waltham, MA
    • S. Ficici, "Solution Concepts in Coevolutionary Algorithms," Ph.D. dissertation, Brandeis Univ., Waltham, MA, 2004.
    • (2004)
    • Ficici, S.1
  • 30
    • 0001118093 scopus 로고
    • "Co-evolving parasites improves simulated evolution as an optimization procedure"
    • D. Hillis, "Co-evolving parasites improves simulated evolution as an optimization procedure," Physica D, vol. 42, pp. 228-234, 1990.
    • (1990) Physica D , vol.42 , pp. 228-234
    • Hillis, D.1
  • 32
    • 0023331258 scopus 로고
    • "An introduction to computing with neural nets"
    • Apr.
    • R. Lippmann, "An introduction to computing with neural nets," IEEE ASSP Mag., pp. 4-22, Apr. 1987.
    • (1987) IEEE ASSP Mag. , pp. 4-22
    • Lippmann, R.1
  • 33
    • 84901411850 scopus 로고    scopus 로고
    • "Piece difference: Simple to evolve?"
    • E. Hughes, "Piece difference: Simple to evolve?," in Proc. Congr. Evol. Comput., 2003, pp. 2470-2473.
    • (2003) Proc. Congr. Evol. Comput. , pp. 2470-2473
    • Hughes, E.1
  • 34
    • 4344679259 scopus 로고    scopus 로고
    • "Competitive coevolution through evolutionary complexification"
    • K. Stanley and R. Miikulainen, "Competitive coevolution through evolutionary complexification," J. Artificial Intell. Res., vol. 21, pp. 63-100, 2004.
    • (2004) J. Artificial Intell. Res. , vol.21 , pp. 63-100
    • Stanley, K.1    Miikulainen, R.2
  • 35
    • 0002933170 scopus 로고
    • "Designing neural networks using genetic algorithm with graph generation system"
    • H. Kitano, "Designing neural networks using genetic algorithm with graph generation system," Complex Syst., vol. 4, pp. 461-476, 1990.
    • (1990) Complex Syst. , vol.4 , pp. 461-476
    • Kitano, H.1
  • 36
    • 0011246591 scopus 로고
    • "Cellular encoding of genetic neural networks"
    • Ecole Normale Superieure de Lyon, Laboratoire de l'Informatique du Parallelisme, Tech. Rep
    • F. Gruau, "Cellular encoding of genetic neural networks," Ecole Normale Superieure de Lyon, Laboratoire de l'Informatique du Parallelisme, Tech. Rep. 92-21, 1992.
    • (1992) , pp. 21-92
    • Gruau, F.1
  • 37
    • 0012329219 scopus 로고    scopus 로고
    • "A comparison between cellular encoding and direct encoding for genetic neural networks"
    • Neuro-colt Tech. Rep. NC-TR-96-048
    • F. Gruau, D. Whitley, and L. Pyeatt, "A comparison between cellular encoding and direct encoding for genetic neural networks," Neuro-colt Tech. Rep. NC-TR-96-048, 1996.
    • (1996)
    • Gruau, F.1    Whitley, D.2    Pyeatt, L.3
  • 38
    • 0038145011 scopus 로고    scopus 로고
    • "Temporal difference learning applied to a high-performance game-playing program"
    • J. Schaeffer, M. Hlynka, and V. Jussila, "Temporal difference learning applied to a high-performance game-playing program," in Proc. Int. Joint Conf. Artif. Intell., 2001, pp. 529-534.
    • (2001) Proc. Int. Joint Conf. Artif. Intell. , pp. 529-534
    • Schaeffer, J.1    Hlynka, M.2    Jussila, V.3
  • 40
    • 27844439040 scopus 로고    scopus 로고
    • "AI techniques for the game of Go"
    • Ph.D. dissertation, Maastricht Univ., Maastricht, The Netherlands
    • E. Van Der Werf, "AI techniques for the game of Go," Ph.D. dissertation, Maastricht Univ., Maastricht, The Netherlands, 2005.
    • (2005)
    • Van Der Werf, E.1
  • 41
    • 0035479281 scopus 로고    scopus 로고
    • "Computer Go: An AI-oriented survey"
    • B. Bouzy and I Cazenave, "Computer Go: An AI-oriented survey," Artif. Intell. J., pp. 39-103, 2001.
    • (2001) Artif. Intell. J. , pp. 39-103
    • Bouzy, B.1    Cazenave, I.2
  • 42
    • 0002882372 scopus 로고    scopus 로고
    • "Knightcap: A chess program that learns by combining TD(λ) with game-tree search"
    • [Online]. Available: citeseer.ist.psu.edu/baxter98knightcap.html
    • J. Baxter, A. Trigdell, and L. Weaver, "Knightcap: A chess program that learns by combining TD(λ) with game-tree search," in Proc. 15th Int. Conf. Mach. Learn., 1998, pp. 28-36. [Online]. Available: citeseer.ist.psu.edu/baxter98knightcap.html.
    • (1998) Proc. 15th Int. Conf. Mach. Learn. , pp. 28-36
    • Baxter, J.1    Trigdell, A.2    Weaver, L.3
  • 44
    • 84944327472 scopus 로고    scopus 로고
    • "Reducing random fluctuations in mutative self-adaptation"
    • Berlin, Germany: Springer-Verlag Parallel Problem Solving from Nature VII (PPSN-2002)
    • T. P. Runarsson, "Reducing random fluctuations in mutative self-adaptation," in Lecture Notes in Computer Science. Berlin, Germany: Springer-Verlag, 2002, vol. 2439, Parallel Problem Solving from Nature VII (PPSN-2002), pp. 194-203.
    • (2002) Lecture Notes in Computer Science , vol.2439 , pp. 194-203
    • Runarsson, T.P.1
  • 45
    • 0033362601 scopus 로고    scopus 로고
    • "Evolving artificial neural networks"
    • X. Yao, "Evolving artificial neural networks,"Proc. IEEE, vol. 87, no. 9, pp. 1423-1447, 1999.
    • (1999) Proc. IEEE , vol.87 , Issue.9 , pp. 1423-1447
    • Yao, X.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.