SCOPUS 정보 검색 플랫폼

Volumn 15, Issue 1, 2007, Pages 33-50

Empirical studies in action selection with reinforcement learning

(3) Whiteson, Shimon a Taylor, Matthew E a Stone, Peter a

a University of Texas at Austin (United States)

Author keywords

Autonomic computing; Evolutionary computation; Neural networks; Reinforcement learning; Robot soccer; Temporal difference methods

Indexed keywords

EID: 33847264400 PISSN: 10597123 EISSN: 17412633 Source Type: Journal
DOI: 10.1177/1059712306076253 Document Type: Article

Times cited : (40)

References (60)

1
- 0000500817
- Interactions between learning and evolution
- Ackley, D., & Littman, M. (1991). Interactions between learning and evolution. Artificial Life II, SFI Studies in the Sciences of Complexity, 10, 487-509.
- (1991) Artificial Life II, SFI Studies in the Sciences of Complexity , vol.10 , pp. 487-509
- Ackley, D.¹ Littman, M.²

2
- 0036568025
- Finite-time analysis of the multi-armed bandit problem
- Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multi-armed bandit problem. Machine Learning, 47, 235-256.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

3
- 84898958374
- Gradient descent for general reinforcement learning
- M. S. Kearns, S. A. Solla, & D. A. Cohn (Eds.), Cambridge, MA: MIT Press.
- Baird, L., & Moore, A. (1999). Gradient descent for general reinforcement learning. In M. S. Kearns, S. A. Solla, & D. A. Cohn (Eds.), Advances in neural information processing systems 11. Cambridge, MA: MIT Press.
- (1999) Advances in neural information processing systems 11
- Baird, L.¹ Moore, A.²

4
- 0001410750
- A new factor in evolution
- Baldwin, J. M. (1896). A new factor in evolution. The American Naturalist, 30, 441-451.
- (1896) The American Naturalist , vol.30 , pp. 441-451
- Baldwin, J.M.¹

5
- 21044455866
- Threshold selection, hypothesis tests and DOE methods
- Beielstein, T., & Markon, S. (2002). Threshold selection, hypothesis tests and DOE methods. In Proceedings of the 2002 World Congress on Evolutionary Computation (pp. 777-782). Honolulu, HI.
- Proceedings of the 2002 World Congress on Evolutionary Computation , pp. 777-782
- Beielstein, T.¹ Markon, S.²

6
- 0004870746
- A problem in the sequential design of experiments
- Bellman, R. E. (1956). A problem in the sequential design of experiments. Sankhya, 16, 221-229.
- (1956) Sankhya , vol.16 , pp. 221-229
- Bellman, R.E.¹

7
- 29344432652
- Boers, E., Borst, M., & Sprinkhuizen-Kuyper, I. (1995). Evolving artificial neural networks using the "Baldwin Effect". Technical Report TR 95-14.
- (1995) Evolving artificial neural networks using the "Baldwin Effect"
- Boers, E.¹ Borst, M.² Sprinkhuizen-Kuyper, I.³

8
- 0001133021
- Generalization in reinforcement learning: Safely approximating the value function
- G. Tesauro, D. S. Touretzky, & T. K. Leen (Eds.), Cambridge, MA: MIT Press.
- Boyan, J. A., & Moore, A. W. (1995). Generalization in reinforcement learning: Safely approximating the value function. In G. Tesauro, D. S. Touretzky, & T. K. Leen (Eds.), Advances in neural information processing systems 7. Cambridge, MA: MIT Press.
- (1995) Advances in neural information processing systems 7
- Boyan, J.A.¹ Moore, A.W.²

9
- 0032208335
- Elevator group control using multiple reinforcement learning agents
- Crites, R. H., & Barto, A. G. (1998). Elevator group control using multiple reinforcement learning agents. Machine Learning, 33, 235-262.
- (1998) Machine Learning , vol.33 , pp. 235-262
- Crites, R.H.¹ Barto, A.G.²

10
- 1542329500
- Reinforced genetic programming
- Downing, K. L. (2001). Reinforced genetic programming. Genetic Programming and Evolvable Machines, 2, 259-288.
- (2001) Genetic Programming and Evolvable Machines , vol.2 , pp. 259-288
- Downing, K.L.¹

11
- 0006054388
- Unifying learning with evolution through Baldwinian evolution and Lamarckism: A case study
- Giraud-Carrier, C. (2000). Unifying learning with evolution through Baldwinian evolution and Lamarckism: A case study. In Proceedings of the Symposium on Computational Intelligence and Learning (CoIL-2000) (pp. 36-41). Chios, Greece.
- Proceedings of the Symposium on Computational Intelligence and Learning (CoIL-2000) , pp. 36-41
- Giraud-Carrier, C.¹

12
- 21244457900
- Gomez, F., & Miikkulainen, R. (2002). Robust nonlinear control through neuroevolution. Technical Report AI02-292.
- (2002) Robust nonlinear control through neuroevolution
- Gomez, F.¹ Miikkulainen, R.²

13
- 32444448207
- Co-evolving recurrent neurons learn deep memory POMDPs
- Gomez, F., & Schmidhuber, J. (2005). Co-evolving recurrent neurons learn deep memory POMDPs. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2005) (pp. 491-498). Washington, DC.
- Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2005) , pp. 491-498
- Gomez, F.¹ Schmidhuber, J.²

14
- 0000211184
- How learning can guide evolution
- Hinton, G. E., & Nowlan, S. J. (1987). How learning can guide evolution. Complex Systems, 1, 495-502.
- (1987) Complex Systems , vol.1 , pp. 495-502
- Hinton, G.E.¹ Nowlan, S.J.²

15
- 0003463297
- Ann Arbor, MI: University of Michigan Press.
- Holland, J. H. (1975). Adaptation in natural and artificial systems: An introductory analysis with applications to biology, control and artificial intelligence. Ann Arbor, MI: University of Michigan Press.
- (1975) Adaptation in natural and artificial systems: An introductory analysis with applications to biology, control and artificial intelligence
- Holland, J.H.¹

16
- 4344663737
- Genetic programming and multi-agent layered learning by reinforcements
- Hsu, W. H., & Gustafson, S. M. (2002). Genetic programming and multi-agent layered learning by reinforcements. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2002) (pp. 764-771). New York, NY.
- Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2002) , pp. 764-771
- Hsu, W.H.¹ Gustafson, S.M.²

17
- 0004280606
- Cambridge, MA: MIT Press.
- Kaelbling, L. P. (1993). Learning in embedded systems. Cambridge, MA: MIT Press.
- (1993) Learning in embedded systems
- Kaelbling, L.P.¹

18
- 0037253062
- The vision of autonomic computing
- Kephart, J. O., & Chess, D. M. (2003). The vision of autonomic computing. Computer, 36, 41-50.
- (2003) Computer , vol.36 , pp. 41-50
- Kephart, J.O.¹ Chess, D.M.²

19
- 9444275934
- Machine learning for fast quadru-pedal locomotion
- Kohl, N., & Stone, P. (2004). Machine learning for fast quadru-pedal locomotion. In Proceedings of the 19th National Conference on Artificial Intelligence (pp. 611-616). San Jose, CA.
- Proceedings of the 19th National Conference on Artificial Intelligence , pp. 611-616
- Kohl, N.¹ Stone, P.²

20
- 84898938510
- Actor-critic algorithms
- M. S. Kearns, S. A. Solla, & D. A. Cohn (Eds.), Cambridge, MA: MIT Press.
- Konda, V. R., & Tsitsiklis, J. N. (1999). Actor-critic algorithms. In M. S. Kearns, S. A. Solla, & D. A. Cohn (Eds.), Advances in neural information processing systems 11 (pp. 1008-1014). Cambridge, MA: MIT Press.
- (1999) Advances in neural information processing systems 11 , pp. 1008-1014
- Konda, V.R.¹ Tsitsiklis, J.N.²

21
- 0035558808
- KaBaGe-RL: Kanerva-based generalisation and reinforcement learning for possession football
- Kostiadis, K., & Hu, H. (2001). KaBaGe-RL: Kanerva-based generalisation and reinforcement learning for possession football. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2001). Maui, HI.
- Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2001)
- Kostiadis, K.¹ Hu, H.²

22
- 0030721089
- Comparison of CMACs and radial basis functions for local function approximators in reinforcement learning
- Kretchmar, R. M., & Anderson, C. W. (1997). Comparison of CMACs and radial basis functions for local function approximators in reinforcement learning. In Proceedings of the International Conference on Neural Networks. Houston, TX.
- Proceedings of the International Conference on Neural Networks
- Kretchmar, R.M.¹ Anderson, C.W.²

23
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning, and teaching
- Lin, L.-J. (1992). Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning, 8, 293-321.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.-J.¹

24
- 0032035573
- Bandit problems and the exploration/exploitation tradeoff
- Macready, W. G., & Wolpert, D. H. (1998). Bandit problems and the exploration/exploitation tradeoff. IEEE Transactions on Evolutionary Computation, 2, 2-22.
- (1998) IEEE Transactions on Evolutionary Computation , vol.2 , pp. 2-22
- Macready, W.G.¹ Wolpert, D.H.²

25
- 29344433509
- Samuel meets Amarel: Automating value function approximation using global state space analysis
- Mahadevan, S. (2005). Samuel meets Amarel: Automating value function approximation using global state space analysis. In Proceedings of the 20th National Conference on Artificial Intelligence. Pittsburgh, PA.
- Proceedings of the 20th National Conference on Artificial Intelligence
- Mahadevan, S.¹

26
- 32344453992
- Culling and teaching in neuro-evolution
- T. Bäck (Ed.)
- McQuesten, P., & Miikkulainen, R. (1997). Culling and teaching in neuro-evolution. In T. Bäck (Ed.), Proceedings of the 7th International Conference on Genetic Algorithms (pp. 760-767). East Lansing, MI.
- Proceedings of the 7th International Conference on Genetic Algorithms , pp. 760-767
- McQuesten, P.¹ Miikkulainen, R.²

27
- 0002318273
- Efficient reinforcement learning through symbiotic evolution
- Moriarty, D. E., & Miikkulainen, R. (1996). Efficient reinforcement learning through symbiotic evolution. Machine Learning, 22, 11-32.
- (1996) Machine Learning , vol.22 , pp. 11-32
- Moriarty, D.E.¹ Miikkulainen, R.²

28
- 0004156494
- Evolutionary algorithms for reinforcement learning
- Moriarty, D. E., Schultz, A. C., & Grefenstette, J. J. (1999). Evolutionary algorithms for reinforcement learning. Journal of Artificial Intelligence Research, 11, 241-276.
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 241-276
- Moriarty, D.E.¹ Schultz, A.C.² Grefenstette, J.J.³

29
- 0141819580
- PEGASUS: A policy search method for large MDPs and POMDPs
- Ng, A. Y., & Jordan, M. I. (2000). PEGASUS: A policy search method for large MDPs and POMDPs. In Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (pp. 406-415). San Mateo, CA: Morgan Kaufmann Publishers.
- Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence , pp. 406-415
- Ng, A.Y.¹ Jordan, M.I.²

30
- 0002898235
- Learning and evolution in neural networks
- Nolfi, S., Elman, J. L., & Parisi, D. (1994). Learning and evolution in neural networks. Adaptive Behavior, 2, 5-28.
- (1994) Adaptive Behavior , vol.2 , pp. 5-28
- Nolfi, S.¹ Elman, J.L.² Parisi, D.³

31
- 4344613905
- Coevolution of a backgammon player
- C. G. Langton & K. Shimohara (Eds.), Cambridge, MA: MIT Press.
- Pollack, J. B., Blair, A. D., & Land, M. (1997). Coevolution of a backgammon player. In C. G. Langton & K. Shimohara (Eds.), Artificial Life V: Proceedings of the 5th International Workshop on the Synthesis and Simulation of Living Systems (pp. 92-98). Cambridge, MA: MIT Press.
- (1997) Artificial Life V: Proceedings of the 5th International Workshop on the Synthesis and Simulation of Living Systems , pp. 92-98
- Pollack, J.B.¹ Blair, A.D.² Land, M.³

32
- 0001355838
- Radial basis functions for multivariate interpolation: A review
- J. C. Mason & M. G. Cox (Eds.), Oxford: Clarendon Press.
- Powell, M. J. D. (1987). Radial basis functions for multivariate interpolation: A review. In J. C. Mason & M. G. Cox (Eds.), Algorithms for Approximation (pp. 143-167). Oxford: Clarendon Press.
- (1987) Algorithms for Approximation , pp. 143-167
- Powell, M.J.D.¹

33
- 33847276511
- Modelling natural action selection: An introduction to the theme issue
- Prescott, T. J., Bryson, J. J., & Seth, A. K. (in press). Modelling natural action selection: An introduction to the theme issue. Philosophical Transactions of the Royal Society B: Biological Sciences.
- Philosophical Transactions of the Royal Society B: Biological Sciences
- Prescott, T.J.¹ Bryson, J.J.² Seth, A.K.³

34
- 33646712258
- Decision tree function approximation in reinforcement learning
- Pyeatt, L. D., & Howe, A. E. (2001). Decision tree function approximation in reinforcement learning. In Proceedings of the 3rd International Symposium on Adaptive Systems: Evolutionary Computation and Probabilistic Graphical Models (pp. 70-77). Havana, Cuba.
- Proceedings of the 3rd International Symposium on Adaptive Systems: Evolutionary Computation and Probabilistic Graphical Models , pp. 70-77
- Pyeatt, L.D.¹ Howe, A.E.²

35
- 33646398129
- Neural fitted Q iteration - First experiences with a data efficient neural reinforcement learning method
- Reidmiller, M. (2005). Neural fitted Q iteration - first experiences with a data efficient neural reinforcement learning method. In Proceedings of the 16th European Conference on Machine Learning (pp. 317-328). Porto, Portugal.
- Proceedings of the 16th European Conference on Machine Learning , pp. 317-328
- Reidmiller, M.¹

36
- 1942516829
- Combining TD-learning with cascade-correlation networks
- Rivest, F., & Precup, D. (2003). Combining TD-learning with cascade-correlation networks. In Proceedings of the 20th International Conference on Machine Learning (pp. 632-639). Washington, DC. Menlo Park, CA: AAAI Press.
- Proceedings of the 20th International Conference on Machine Learning , pp. 632-639
- Rivest, F.¹ Precup, D.²

37
- 0003636089
- Engineering Department, Cambridge University.
- Rummery, G. A., & Niranjan, M. (1994). On-line Q-learning using connectionist systems. Technical report CUED/FINFENG-RT 116, Engineering Department, Cambridge University.
- (1994) On-line Q-learning using connectionist systems
- Rummery, G.A.¹ Niranjan, M.²

38
- 29244474089
- Co-evolution versus self-play temporal difference learning for acquiring position evaluation in small-board go
- Runarsson, T. P., & Lucas, S. M. (2005). Co-evolution versus self-play temporal difference learning for acquiring position evaluation in small-board go. IEEE Transactions on Evolutionary Computation, 9, 628-640.
- (2005) IEEE Transactions on Evolutionary Computation , vol.9 , pp. 628-640
- Runarsson, T.P.¹ Lucas, S.M.²

39
- 26944466214
- Function approximation via tile coding: Automating parameter choice
- J.-D. Zucker & I. Saitta (Eds.)
- Sherstov, A. A., & Stone, P. (2005). Function approximation via tile coding: Automating parameter choice. In J.-D. Zucker & I. Saitta (Eds.), Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA 2005), Lecture Notes in Artificial Intelligence (Vol. 3607, pp. 194-205). Berlin: Springer-Verlag.
- Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA 2005), Lecture Notes in Artificial Intelligence , pp. 194-205
- Sherstov, A.A.¹ Stone, P.²

40
- 0029753630
- Reinforcement learning with replacing eligibility traces
- Singh, S. P., & Sutton, R. S. (1996). Reinforcement learning with replacing eligibility traces. Machine Learning, 22, 123-158.
- (1996) Machine Learning , vol.22 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

41
- 0001898381
- Practical reinforcement learning in continuous spaces
- Smart, W. D., & Kaelbling, L. P. (2000). Practical reinforcement learning in continuous spaces. In Proceedings of the 17th International Conference on Machine Learning (pp. 903-910). Stanford University, CA.
- Proceedings of the 17th International Conference on Machine Learning , pp. 903-910
- Smart, W.D.¹ Kaelbling, L.P.²

42
- 84878524995
- Averaging efficiently in the presence of noise
- Amsterdam, the Netherlands.
- Stagge, P. (1998). Averaging efficiently in the presence of noise. In Parallel problem solving from nature (Vol. 5, pp. 188-197). Amsterdam, the Netherlands.
- (1998) Parallel problem solving from nature , pp. 188-197
- Stagge, P.¹

43
- 84901454220
- Evolving adaptive neural networks with and without adaptive synapses
- Stanley, K. O., Bryant, B. D., & Miikkulainen, R. (2003). Evolving adaptive neural networks with and without adaptive synapses. In Proceedings of the 2003 Congress on Evolutionary Computation (CEC 2003) (Vol. 4, pp. 2557-2564). Canberra, Australia.
- Proceedings of the 2003 Congress on Evolutionary Computation (CEC 2003) , pp. 2557-2564
- Stanley, K.O.¹ Bryant, B.D.² Miikkulainen, R.³

44
- 0036594106
- Evolving neural networks through augmenting topologies
- Stanley, K. O., & Miikkulainen, R. (2002). Evolving neural networks through augmenting topologies. Evolutionary Computation, 10, 99-127.
- (2002) Evolutionary Computation , vol.10 , pp. 99-127
- Stanley, K.O.¹ Miikkulainen, R.²

45
- 4344679259
- Competitive coevolution through evolutionary complexification
- Stanley, K. O., & Miikkulainen, R. (2004a). Competitive coevolution through evolutionary complexification. Journal of Artificial Intelligence Research, 21, 63-100.
- (2004) Journal of Artificial Intelligence Research , vol.21 , pp. 63-100
- Stanley, K.O.¹ Miikkulainen, R.²

46
- 29244444768
- Evolving a roving eye for go
- Stanley, K. O., & Miikkulainen, R. (2004b). Evolving a roving eye for go. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004). Seattle, WA.
- Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004)
- Stanley, K.O.¹ Miikkulainen, R.²

47
- 27544506565
- Reinforcement learning for RoboCup-soccer keepaway
- Stone, P., Sutton, R. S., & Kuhlmann, G. (2005). Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13, 165-188.
- (2005) Adaptive Behavior , vol.13 , pp. 165-188
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

48
- 37249034293
- Keepaway soccer: From machine learning testbed to benchmark
- I. Noda, A. Jacoff, A. Bredenfeld, & Y. Takahashi (Eds.), Berlin: Springer-Verlag.
- Stone, P., Kuhlmann, G., Taylor, M. E., & Liu, Y. (2006). Keepaway soccer: From machine learning testbed to benchmark. In I. Noda, A. Jacoff, A. Bredenfeld, & Y. Takahashi (Eds.), RoboCup-2005: Robot Soccer World Cup IX (Vol. 4020, pp. 93-105). Berlin: Springer-Verlag.
- (2006) RoboCup-2005: Robot Soccer World Cup IX , pp. 93-105
- Stone, P.¹ Kuhlmann, G.² Taylor, M.E.³ Liu, Y.⁴

49
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.¹

50
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- S. A. Solla, T. K. Leen, & K.-R. Muller (Eds.), Cambridge, MA: MIT Press.
- Sutton, R., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. In S. A. Solla, T. K. Leen, & K.-R. Muller (Eds.), Advances in neural information processing systems (Vol. 12, pp. 1057-1063). Cambridge, MA: MIT Press.
- (2000) Advances in neural information processing systems , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

51
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Cambridge, MA: MIT Press.
- Sutton, R. S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Advances in neural information processing systems 8 (pp. 1038-1044). Cambridge, MA: MIT Press.
- (1996) Advances in neural information processing systems 8 , pp. 1038-1044
- Sutton, R.S.¹

52
- 0004102479
- Cambridge, MA: MIT Press.
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

53
- 33750259111
- Comparing evolutionary and temporal difference methods in a reinforcement learning domain
- Taylor, M. E., Whiteson, S., & Stone, P. (2006). Comparing evolutionary and temporal difference methods in a reinforcement learning domain. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2006) (pp. 1321-1328). Seattle, WA.
- Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2006) , pp. 1321-1328
- Taylor, M.E.¹ Whiteson, S.² Stone, P.³

54
- 4544366889
- Utility functions in autonomic systems
- Walsh, W. E., Tesauro, G., Kephart, J. O., & Das, R. (2004). Utility functions in autonomic systems. In Proceedings of the International Conference on Autonomic Computing (pp. 70-77). New York, NY.
- Proceedings of the International Conference on Autonomic Computing , pp. 70-77
- Walsh, W.E.¹ Tesauro, G.² Kephart, J.O.³ Das, R.⁴

55
- 0004049893
- Ph.D. Thesis, King's College, Cambridge
- Watkins, C. (1989). Learning from Delayed Rewards. Ph.D. Thesis, King's College, Cambridge.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

56
- 33646714634
- Evolutionary function approximation for reinforcement learning
- Whiteson, S., & Stone, P. (2006a). Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research, 7, 877-917.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 877-917
- Whiteson, S.¹ Stone, P.²

57
- 33750687531
- Sample-efficient evolutionary function approximation for reinforcement learning
- Whiteson, S., & Stone, P. (2006b). Sample-efficient evolutionary function approximation for reinforcement learning. In Proceedings of the 21st National Conference on Artificial Intelligence (AAAI 2006) (pp. 518-523). Boston, MA.
- Proceedings of the 21st National Conference on Artificial Intelligence (AAAI 2006) , pp. 518-523
- Whiteson, S.¹ Stone, P.²

58
- 21244469857
- Evolving keepaway soccer players through task decomposition
- Whiteson, S., Kohl, N., Miikkulainen, R., & Stone, P. (2005). Evolving keepaway soccer players through task decomposition. Machine Learning, 59, 5-30.
- (2005) Machine Learning , vol.59 , pp. 5-30
- Whiteson, S.¹ Kohl, N.² Miikkulainen, R.³ Stone, P.⁴

59
- 0027701513
- Genetic reinforcement learning for neurocontrol problems
- Whitley, D., Dominic, S., Das, R., & Anderson, C. W. (1993). Genetic reinforcement learning for neurocontrol problems. Machine Learning, 13, 259-284.
- (1993) Machine Learning , vol.13 , pp. 259-284
- Whitley, D.¹ Dominic, S.² Das, R.³ Anderson, C.W.⁴

60
- 0033362601
- Evolving artificial neural networks
- Yao, X. (1999). Evolving artificial neural networks. Proceedings of the IEEE, 87, 1423-1447.
- (1999) Proceedings of the IEEE , vol.87 , pp. 1423-1447
- Yao, X.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.