SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Proceedings of the National Conference on Artificial Intelligence

Volumn 1, Issue , 2006, Pages 518-523

Sample-efficient evolutionary function approximation for reinforcement learning

(2) Whiteson, Shimon a Stone, Peter a

a University of Texas at Austin (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EVOLUTIONARY FUNCTION APPROXIMATION; FUNCTION APPROXIMATORS; LEARNING PROBLEMS; PARAMETERIZED FUNCTIONS;

APPROXIMATION THEORY; ARTIFICIAL INTELLIGENCE; EVOLUTIONARY ALGORITHMS; FUNCTION EVALUATION; PROBLEM SOLVING;

LEARNING SYSTEMS;

EID: 33750687531 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (7)

References (22)

1
- 0000500817
- Interactions between learning and evolution
- Ackley, D., and Littman, M. 1991. Interactions between learning and evolution. Artificial Life II, SFI Studies in the Sciences of Complexity 10:487-509.
- (1991) Artificial Life II, SFI Studies in the Sciences of Complexity , vol.10 , pp. 487-509
- Ackley, D.¹ Littman, M.²

2
- 84898958374
- Gradient descent for general reinforcement learning
- MIT Press
- Baird, L., and Moore, A. 1999. Gradient descent for general reinforcement learning. In Advances in Neural Information Processing Systems II. MIT Press.
- (1999) Advances in Neural Information Processing Systems II
- Baird, L.¹ Moore, A.²

3
- 85151728371
- Residual algorithms: Reinforcement learning with function approximation
- Morgan Kaufmann
- Baird, L. 1995. Residual algorithms: Reinforcement learning with function approximation. In Proceedings of the Twelfth International Conference on Machine Learning, 30-37. Morgan Kaufmann.
- (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 30-37
- Baird, L.¹

4
- 0001410750
- A new factor in evolution
- Baldwin, J. M. 1896. A new factor in evolution. The American Naturalist 30:441-451.
- (1896) The American Naturalist , vol.30 , pp. 441-451
- Baldwin, J.M.¹

5
- 0032208335
- Elevator group control using multiple reinforcement learning agents
- Crites, R. H., and Barto, A. G. 1998. Elevator group control using multiple reinforcement learning agents. Machine Learning 33(2-3):235-262.
- (1998) Machine Learning , vol.33 , Issue.2-3 , pp. 235-262
- Crites, R.H.¹ Barto, A.G.²

6
- 0003722376
- Goldberg, D. E. 1989. Genetic Algorithms in Search, Optimization and Machine Learning.
- (1989) Genetic Algorithms in Search, Optimization and Machine Learning
- Goldberg, D.E.¹

7
- 0000211184
- How learning can guide evolution
- Hinton, G.E., and Nowlan, S. J. 1987. How learning can guide evolution. Complex Systems 1:495-502.
- (1987) Complex Systems , vol.1 , pp. 495-502
- Hinton, G.E.¹ Nowlan, S.J.²

8
- 0037253062
- The vision of autonomie computing
- Kephart, J. O., and Chess, D. M. 2003. The vision of autonomie computing. Computer 36(1):41-50.
- (2003) Computer , vol.36 , Issue.1 , pp. 41-50
- Kephart, J.O.¹ Chess, D.M.²

9
- 9444275934
- Machine learning for fast quadrupedal locomotion
- Kohl, N., and Stone, P. 2004. Machine learning for fast quadrupedal locomotion. In The Nineteenth National Conference on Artificial Intelligence, 611-616.
- (2004) The Nineteenth National Conference on Artificial Intelligence , pp. 611-616
- Kohl, N.¹ Stone, P.²

10
- 4644323293
- Least-squares policy iteration
- Lagoudakis, M. G., and Parr, R. 2003. Least-squares policy iteration. Journal of Machine Learning Research 4(2003):1107-1149.
- (2003) Journal of Machine Learning Research , vol.4 , Issue.2003 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

11
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning, and teaching
- Lin, L.-J. 1992. Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning 8(3-4):293-321.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 293-321
- Lin, L.-J.¹

12
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- Moore, A. W., and Atkeson, C. G. 1993. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning 13(1):103-130.
- (1993) Machine Learning , vol.13 , Issue.1 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

13
- 33646398129
- Neural fitted Q iteration - First experiences with a data efficient neural reinforcement learning method
- Reidmiller, M. 2005. Neural fitted Q iteration - first experiences with a data efficient neural reinforcement learning method. In Proceedings of the Sixteenth European Conference on Machine Learning, 317-328.
- (2005) Proceedings of the Sixteenth European Conference on Machine Learning , pp. 317-328
- Reidmiller, M.¹

14
- 0000646059
- Learning internal representations by error propagation
- Rumelhart, D. E.; Hinton, G. E.; and Williams, R. J. 1986. Learning internal representations by error propagation. In Parallel Distributed Processing. 318-362.
- (1986) Parallel Distributed Processing , pp. 318-362
- Rumelhart, D.E.¹ Hinton, G.E.² Williams, R.J.³

15
- 0036594106
- Evolving neural networks through augmenting topologies
- Stanley, K. O., and Miikkulainen, R. 2002. Evolving neural networks through augmenting topologies. Evolutionary Computation 10(2):99-127.
- (2002) Evolutionary Computation , vol.10 , Issue.2 , pp. 99-127
- Stanley, K.O.¹ Miikkulainen, R.²

16
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., and Barto, A. G. 1998. Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

17
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- Sutton, R.; McAllester, D.; Singh, S.; and Mansour, Y. 2000. Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems, 1057-1063.
- (2000) Advances in Neural Information Processing Systems , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

18
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. 1994. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation 6(2):215-219.
- (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.¹

19
- 4544366889
- Utility functions in autonomie systems
- Walsh, W. E.; Tesauro, G.; Kephart, J. O.; and Das, R. 2004. Utility functions in autonomie systems. In Proceedings of the International Conference on Autonomic Computing, 70-77.
- (2004) Proceedings of the International Conference on Autonomic Computing , pp. 70-77
- Walsh, W.E.¹ Tesauro, G.² Kephart, J.O.³ Das, R.⁴

20
- 0004049893
- Ph.D. Dissertation, King's College, Cambridge
- Watkins, C. 1989. Learning from Delayed Rewards. Ph.D. Dissertation, King's College, Cambridge.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

21
- 33646714634
- Evolutionary function approximation for reinforcement learning
- To appear
- Whiteson, S., and Stone, P. 2006. Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research. To appear.
- (2006) Journal of Machine Learning Research
- Whiteson, S.¹ Stone, P.²

22
- 0033362601
- Evolving artificial neural networks
- Yao, X. 1999. Evolving artificial neural networks. Proceedings of the IEEE 87(9):1423-1447.
- (1999) Proceedings of the IEEE , vol.87 , Issue.9 , pp. 1423-1447
- Yao, X.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.