SCOPUS 정보 검색 플랫폼

Proceedings of the International Conference on Autonomous Agents

Volumn , Issue , 2007, Pages 662-669

Batch reinforcement learning in a complex domain

(2) Kalyanakrishnan, Shivaram a Stone, Peter a

a University of Texas at Austin (United States)

Author keywords

Evolution and adaptation; Learning; Perception and action

Indexed keywords

AGENT-BASED; ALGORITHMIC VARIANTS; ASYMPTOTIC PERFORMANCE; BATCH METHODS; BATCH REINFORCEMENT LEARNING; COMPLEX DOMAINS; DIMENSIONAL CONTROLS; EMPIRICAL PERFORMANCE; EVOLUTION AND ADAPTATION; EXPERIENCE DATUM; KEEPAWAY; LEARNING; MULTI AGENTS; ON-LINE ALGORITHMS; PERCEPTION AND ACTION; POLE BALANCING; ROBOCUP SOCCERS; SAMPLE COMPLEXITY; TEMPORAL DIFFERENCE REINFORCEMENT LEARNING;

AGENTS; AUTONOMOUS AGENTS; EDUCATION; FUZZY CLUSTERING; REINFORCEMENT; REINFORCEMENT LEARNING;

LEARNING ALGORITHMS;

EID: 60349130974 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1329125.1329241 Document Type: Conference Paper

Times cited : (44)

References (16)

1
- 0003942195
- BYTE Books, Peterborough
- J. S. Albus. Brains, Behavior, and Robotics. BYTE Books, Peterborough, 1981.
- (1981) Brains, Behavior, and Robotics
- Albus, J.S.¹

2
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel. Tree-based batch mode reinforcement learning. J. Mach. Learn. Res., 6:503-556, 2005.
- (2005) J. Mach. Learn. Res , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

3
- 38049028224
- Half field offense in RoboCup soccer: A multiagent reinforcement learning case study
- June
- S. Kalyanakrishnan, Y. Liu, and P. Stone. Half field offense in RoboCup soccer: A multiagent reinforcement learning case study. Proceedings of the RoboCup International Symposium 2006, June 2006.
- (2006) Proceedings of the RoboCup International Symposium 2006
- Kalyanakrishnan, S.¹ Liu, Y.² Stone, P.³

4
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Parr. Least-squares policy iteration. Journal of Machine Learning Research, 4:1107-1149, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

5
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L.-J. Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8:293-321, 1992.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.-J.¹

6
- 50249100331
- Users manual: RoboCup soccer server -for soccer server version 7.07 and later
- August
- M.Chen, E.Foroughi, F.Heintz, Z.Huang, S.Kapetanakis, K.Kostiadis, J.Kummeneje, I.Noda, O.Obst, P.Riley, T.Steffens, Y.Waug, and X.Yin. Users manual: RoboCup soccer server -for soccer server version 7.07 and later. The RoboCup Federation, August 2002.
- (2002) The RoboCup Federation
- Chen, M.¹ Foroughi, E.² Heintz, F.³ Huang, Z.⁴ Kapetanakis, S.⁵ Kostiadis, K.⁶ Kummeneje, J.⁷ Noda, I.⁸ Obst, O.⁹ Riley, P.¹⁰ Steffens, T.¹¹ Waug, Y.¹² Yin, X.¹³

7
- 84898980684
- Autonomous helicopter flight via reinforcement learning
- S. Thrun, L. Saul, and B. Schölkopf, editors, MIT Press, Cambridge, MA
- A. Y. Ng, H. J. Kim, M. I. Jordan, and S. Sastry. Autonomous helicopter flight via reinforcement learning. In S. Thrun, L. Saul, and B. Schölkopf, editors, Advances in Neural Information Processing Systems 16. MIT Press, Cambridge, MA, 2004.
- (2004) Advances in Neural Information Processing Systems 16
- Ng, A.Y.¹ Kim, H.J.² Jordan, M.I.³ Sastry, S.⁴

8
- 33646398129
- Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method
- J. Gama, R. Camacho, P. Brazdil, A. Jorge, and L. Torgo, editors, ECML, of, Springer
- M. Riedmiller. Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method. In J. Gama, R. Camacho, P. Brazdil, A. Jorge, and L. Torgo, editors, ECML, volume 3720 of Lecture Notes in Computer Science, pages 317-328. Springer, 2005.
- (2005) Lecture Notes in Computer Science , vol.3720 , pp. 317-328
- Riedmiller, M.¹

9
- 0003636089
- On-line Q-learning using connectionist systems
- Cambridge University Engineering Department
- G. A. Rummery and M. Niranjan. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG/TR 166, Cambridge University Engineering Department, 1994.
- (1994) Technical Report CUED/F-INFENG/TR , vol.166
- Rummery, G.A.¹ Niranjan, M.²

10
- 37249034293
- Keepaway soccer: From machine learning testbed to benchmark
- P. Stone, G. Kuhlmann, M. E. Taylor, and Y. Liu. Keepaway soccer: From machine learning testbed to benchmark. R.oboCup-2005: Robot Soccer World Cup IX, 4020:93-105, 2006.
- (2006) R.oboCup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
- Stone, P.¹ Kuhlmann, G.² Taylor, M.E.³ Liu, Y.⁴

11
- 27544506565
- Reinforcement learning for RoboCup-soccer keepaway
- P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive. Behavior, 13(3):165-188, 2005.
- (2005) Adaptive. Behavior , vol.13 , Issue.3 , pp. 165-188
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

12
- 0004102479
- MIT Press, Cambridge, MA
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

13
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- R. S. Sutton, D. Precup, and S. P. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2):181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.P.³

14
- 33750259111
- Comparing evolutionary and temporal difference methods for reinforcement learning
- July
- M. Taylor, S. Whiteson, and P. Stone. Comparing evolutionary and temporal difference methods for reinforcement learning. In Proceedings of the Genetic and Evolutionary Computation Conference, pages 1321-28, July 2006.
- (2006) Proceedings of the Genetic and Evolutionary Computation Conference , pp. 1321-1328
- Taylor, M.¹ Whiteson, S.² Stone, P.³

15
- 27544473171
- Behavior transfer for value-function-based reinforcement learning
- F. Dignum, V. Dignum, S. Koenig, S. Kraus, M. P. Singh, and M. Wooldridge, editors, New York, NY, July, ACM Press
- M. E. Taylor and P. Stone. Behavior transfer for value-function-based reinforcement learning. In F. Dignum, V. Dignum, S. Koenig, S. Kraus, M. P. Singh, and M. Wooldridge, editors, The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 53-59, New York, NY, July 2005. ACM Press.
- (2005) The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 53-59
- Taylor, M.E.¹ Stone, P.²

16
- 34249833101
- Q-learning
- C. J. C. H. Watkins and P. Dayan. Q-learning. Machine Learning, 8(3-4):279-292, 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.