SCOPUS 정보 검색 플랫폼

Proceedings of the International Conference on Autonomous Agents

Volumn , Issue , 2007, Pages 1128-1135

IFSA: Incremental feature-set augmentation for reinforcement learning tasks

(3) Ahmadi, Mazda a Taylor, Matthew E a Stone, Peter a

a University of Texas at Austin (United States)

Author keywords

Reinforcement learning

Indexed keywords

CURSE OF DIMENSIONALITIES; DIFFERENT DOMAINS; DOMAIN EXPERTS; DOMAIN KNOWLEDGE; EXPERT KNOWLEDGE; FEATURE SETS; FEATURE SUBSETS; KEEPAWAY; REAL-WORLD PROBLEMS; REINFORCEMENT LEARNING ALGORITHMS; ROBOCUP SOCCERS; SPEED-UP; STATE SPACES;

AGENTS; AUTONOMOUS AGENTS; EDUCATION; FUZZY CLUSTERING; KNOWLEDGE MANAGEMENT; REINFORCEMENT; REINFORCEMENT LEARNING;

LEARNING ALGORITHMS;

EID: 60349099888 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1329125.1329351 Document Type: Conference Paper

Times cited : (7)

References (17)

1
- 84898958374
- Gradient descent for general, reinforcement learning
- M. J. Kearns, S. A. Sofia, and D. A. Cohn, editors, The MIT Press
- L. C. Baird and A. W. Moore. Gradient descent for general, reinforcement learning. In M. J. Kearns, S. A. Sofia, and D. A. Cohn, editors, Advances in Neural Information Processing Systems, volume 11, pages 968-974. The MIT Press, 1999.
- (1999) Advances in Neural Information Processing Systems , vol.11 , pp. 968-974
- Baird, L.C.¹ Moore, A.W.²

2
- 85156187730
- Improving elevator performance using reinforcement learning
- D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Cambridge, MA, MIT Press
- R. H. Crites and A. G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1017-1023, Cambridge, MA, 1996. MIT Press.
- (1996) Advances in Neural Information Processing Systems 8 , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

3
- 0001806701
- The MAXQ method for hierarchical reinforcement learning
- Morgan Kaufmann
- T. G. Dietterich. The MAXQ method for hierarchical reinforcement learning. In Proceedings of the Fifteenth International Conference on Machine Learning. Morgan Kaufmann, 1998.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning
- Dietterich, T.G.¹

4
- 60349092048
- T. Edmunds. An optimum agent for the discrete triathlon. In Reinforcement Learning Benchmarks and Bake-offs Il A workshop at the 2005 NIPS conference, 2005.
- T. Edmunds. An optimum agent for the discrete triathlon. In Reinforcement Learning Benchmarks and Bake-offs Il A workshop at the 2005 NIPS conference, 2005.

5
- 0013465036
- Discovering hierarchy in reinforcement learning with HEXQ
- B. Hengst. Discovering hierarchy in reinforcement learning with HEXQ. In Proc. 19th International Conf. on Machine Learning, pages 243-250, 2002.
- (2002) Proc. 19th International Conf. on Machine Learning , pp. 243-250
- Hengst, B.¹

6
- 32144462307
- Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer
- July
- G. Kuhlmann, P. Stone, R. Mooney, and J. Shavlik. Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer. In The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems, July 2004.
- (2004) The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems
- Kuhlmann, G.¹ Stone, P.² Mooney, R.³ Shavlik, J.⁴

7
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Part. Least-squares policy iteration. Journal of Machine Learning Research, 4:1107-1149, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Part, R.²

8
- 0029732210
- Creating advice-taking reinforcement learners
- R. Maclin and J. W. Shavlik. Creating advice-taking reinforcement learners. Machine Learning, 22:251-282, 1996.
- (1996) Machine Learning , vol.22 , pp. 251-282
- Maclin, R.¹ Shavlik, J.W.²

9
- 84898980684
- Autonomous helicopter flight via reinforcement learning
- MIT Press, To Appear
- A. Y. Ng, H. J. Kim, M. I. Jordan, and S. Sastry. Autonomous helicopter flight via reinforcement learning. In Advances in Neural Information Pwcessing Systems 17. MIT Press, 2004. To Appear.
- (2004) Advances in Neural Information Pwcessing Systems 17
- Ng, A.Y.¹ Kim, H.J.² Jordan, M.I.³ Sastry, S.⁴

10
- 0003636089
- On-line Q-learning using connectionist systems
- Engineering Department, Cambridge University
- G. A. Rummery and M. Niranjan. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG-RT 116, Engineering Department, Cambridge University, 1994.
- (1994) Technical Report CUED/F-INFENG-RT , vol.116
- Rummery, G.A.¹ Niranjan, M.²

11
- 0029753630
- Reinforcement learning with replaceing eligibility traces
- S. P. Singh and R. S. Sutton. Reinforcement learning with replaceing eligibility traces. Machine Learning, 22:123-158, 1996.
- (1996) Machine Learning , vol.22 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

12
- 0003401114
- MIT Press
- P. Stone. Layered Learning in Multiagent Systems: A Winning Approach to Robotic Soccer. MIT Press, 2000.
- (2000) Layered Learning in Multiagent Systems: A Winning Approach to Robotic Soccer
- Stone, P.¹

13
- 37249034293
- Keepaway soccer: From machine learning testbed to benchmark
- I. Noda, A. Jacoff, A. Bredenfeld, and Y. Takahashi, editors, Springer Verlag, Bertin
- P. Stone, G. Kuhlmann, M. E. Taylor, and Y. Liu. Keepaway soccer: From machine learning testbed to benchmark. In I. Noda, A. Jacoff, A. Bredenfeld, and Y. Takahashi, editors, RoboCup-2005: Robot Soccer World Cup IX, volume 4020, pages 93-105. Springer Verlag, Bertin, 2006.
- (2006) RoboCup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
- Stone, P.¹ Kuhlmann, G.² Taylor, M.E.³ Liu, Y.⁴

14
- 27544506565
- Reinforcement learning for RoboCup-soccer keepaway
- P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3): 165-188, 2005.
- (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

15
- 0033170372
- Between mdps and semi-indps: A framework for temporal abstraction in reinforcement learning
- R. Sutton, D. Precup, and S. Singh. Between mdps and semi-indps: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-2, 11, 1999.
- (1999) Artificial Intelligence , vol.112 , Issue.181-182 , pp. 11
- Sutton, R.¹ Precup, D.² Singh, S.³

16
- 0004102479
- MIT Press, Cambridge, MA
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

17
- 60349087996
- G. Tesauro. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
- G. Tesauro. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.