SCOPUS 정보 검색 플랫폼

AI Magazine

Volumn 31, Issue 2, 2010, Pages 81-94

The reinforcement learning competitions

(3) Whiteson, Shimon a Tanner, Brian b White, Adam b

a UNIVERSITY OF AMSTERDAM (Netherlands)

b UNIVERSITY OF ALBERTA (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

INTELLIGENT AGENTS; MACHINE LEARNING;

EMPIRICAL EVALUATIONS; EVALUATION FRAMEWORK; REINFORCEMENT LEARNING AGENT; ROBUST LEARNING; SOFTWARE INFRASTRUCTURE;

REINFORCEMENT LEARNING;

EID: 79951878534 PISSN: 07384602 EISSN: None Source Type: Journal
DOI: 10.1609/aimag.v31i2.2227 Document Type: Article

Times cited : (41)

References (23)

1
- 0034859944
- Autonomous helicopter control using reinforcement learning policy search methods
- Piscataway NJ: Institute of Electrical and Electronics Engineers, Inc.
- Bagnell, J., and Schneider, J. 2001. Autonomous Helicopter Control Using Reinforcement Learning Policy Search Methods. In Proceedings of the International Conference on Robotics and Automation 2001, 1615-1620. Piscataway NJ: Institute of Electrical and Electronics Engineers, Inc.
- (2001) Proceedings of the International Conference on Robotics and Automation 2001 , pp. 1615-1620
- Bagnell, J.¹ Schneider, J.²

2
- 0004870746
- A problem in the sequential design of experiments
- Bellman, R. E. 1956. A Problem in the Sequential Design of Experiments. Sankhya 16(3,4): 221-229.
- (1956) Sankhya , vol.16 , Issue.3-4 , pp. 221-229
- Bellman, R.E.¹

3
- 0008812318
- Belmont, MA: Athena Scientific
- Bertsekas, D. P., and Tsitsiklis, J. N. 1996. Neural Dynamic Programming. Belmont, MA: Athena Scientific.
- (1996) Neural Dynamic Programming.
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 85153940465
- Generalization in reinforcement learning: Safely approximating the value function
- Cambridge, MA: The MIT Press
- Boyan, J. A., and Moore, A. W. 1995. Generalization in Reinforcement Learning: Safely Approximating the Value Function. In Advances in Neural Information Processing Systems 7, 369-376. Cambridge, MA: The MIT Press.
- (1995) Advances in Neural Information Processing Systems 7 , pp. 369-376
- Boyan, J.A.¹ Moore, A.W.²

5
- 0028605089
- Swinging up the acrobot: An example of intelligent control
- Piscataway, NJ: Institute of Electrical and Electronics Engineers, Inc.
- Dejong, G., and Spong, M. W. 1994. Swinging Up the Acrobot: An Example of Intelligent Control. In Proceedings of the American Control Conference, 2158-2162. Piscataway, NJ: Institute of Electrical and Electronics Engineers, Inc.
- (1994) Proceedings of the American Control Conference , pp. 2158-2162
- Dejong, G.¹ Spong, M.W.²

6
- 35248818685
- Tetris is hard, even to approximate
- Lecture Notes in Computer Science, Springer
- Demaine, D. E.; Hohenberger, S.; and Liben-Nowell, D. 2003. Tetris Is Hard, Even to Approximate. In Proceedings of the Ninth International Computing and Combinatorics Conference, 351-363. Lecture Notes in Computer Science, Volume 2697. Berlin: Springer.
- (2003) Proceedings of the Ninth International Computing and Combinatorics Conference , vol.2697 , pp. 351-363
- Demaine, D.E.¹ Hohenberger, S.² Liben-Nowell, D.³

7
- 56449093331
- An objectoriented representation for efficient reinforcement learning
- New York: Association for Computing Machinery
- Diuk, C.; Cohen, A.; and Littman, M. 2008. An ObjectOriented Representation for Efficient Reinforcement Learning. In Proceedings of the 25th International Conference on Machine Learning, 240-247. New York: Association for Computing Machinery.
- (2008) Proceedings of the 25th International Conference on Machine Learning , pp. 240-247
- Diuk, C.¹ Cohen, A.² Littman, M.³

8
- 0003722376
- Cambridge, MA: Addison-Wesley
- Goldberg, D. E. 1989. Genetic Algorithms in Search, Optimization, and Machine Learning. Cambridge, MA: Addison-Wesley.
- (1989) Genetic Algorithms in Search, Optimization, and Machine Learning
- Goldberg, D.E.¹

9
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P.; Littman, M. L.; and Moore, A. P. 1996. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research 4: 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.P.³

10
- 72749107057
- Neuroevolutionary reinforcement learning for generalized helicopter control
- New York: Association for Computing Machinery
- Koppejan, R., and Whiteson, S. 2009. Neuroevolutionary Reinforcement Learning for Generalized Helicopter Control. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2009), 145-152. New York: Association for Computing Machinery.
- (2009) Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2009) , pp. 145-152
- Koppejan, R.¹ Whiteson, S.²

11
- 33744488034
- Inverted autonomous helicopter flight via reinforcement learning
- Berlin: Springer
- Ng, A. Y.; Coates, A.; Diel, M.; Ganapathi, V.; Schulte, J.; Tse, B.; Berger, E.; and Liang, E. 2004. Inverted Autonomous Helicopter Flight Via Reinforcement Learning. In Proceedings of the International Symposium on Experimental Robotics, 363-372. Berlin: Springer.
- (2004) Proceedings of the International Symposium on Experimental Robotics , pp. 363-372
- Ng, A.Y.¹ Coates, A.² Diel, M.³ Ganapathi, V.⁴ Schulte, J.⁵ Tse, B.⁶ Berger, E.⁷ Liang, E.⁸

12
- 0032021222
- Soccer server: A tool for research on multiagent systems
- Noda, I.; Matsubara, H.; Hiraki, K.; and Frank, I. 1998. Soccer Server: A Tool for Research on Multiagent Systems. Applied Artificial Intelligence 12(1): 233-250. (Pubitemid 127619180)
- (1998) Applied Artificial Intelligence , vol.12 , Issue.2-3 , pp. 233-250
- Noda, I.¹ Matsubara, H.² Hiraki, K.³ Frank, I.⁴

13
- 79951937255
- A novel benchmark methodology and data repository for real-life reinforcement learning
- New York: Association for Computing Machinery
- Nouri, A.; Littman, M. L.; Li, L.; Parr, R.; Painter-Wakefield, C.; and Taylor, G. 2009. A Novel Benchmark Methodology and Data Repository for Real-Life Reinforcement Learning. In Proceedings of the 26th International Conference on Machine Learning. New York: Association for Computing Machinery.
- (2009) Proceedings of the 26th International Conference on Machine Learning
- Nouri, A.¹ Littman, M.L.² Li, L.³ Parr, R.⁴ Painter-Wakefield, C.⁵ Taylor, G.⁶

14
- 37249034293
- Keepaway soccer: From machine learning testbed to benchmark
- Berlin: Springer Verlag
- Stone, P.; Kuhlmann, G.; Taylor, M. E.; and Liu, Y. 2005. Keepaway Soccer: From Machine Learning Testbed to Benchmark. In Robocup-2005: Robot Soccer World Cup IX, Volume 4020, 93-105. Berlin: Springer Verlag.
- (2005) Robocup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
- Stone, P.¹ Kuhlmann, G.² Taylor, M.E.³ Liu, Y.⁴

15
- 27544506565
- Reinforcement learning in robocup-soccer keepaway
- Stone, P.; Sutton, R. S.; and Kuhlmann, G. 2005. Reinforcement Learning in Robocup-Soccer Keepaway. Adaptive Behavior 13(3): 165-188.
- (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

16
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- Cambridge, MA: The MIT Press
- Sutton, R. S. 1996. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. In Proceedings of Advances in Neural Information Processing Systems 8, 1038-1044. Cambridge, MA: The MIT Press.
- (1996) Proceedings of Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
- Sutton, R.S.¹

17
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R. S. 1988. Learning to Predict by the Methods of Temporal Differences. Machine Learning 3(1): 9-44.
- (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
- Sutton, R.S.¹

18
- 0004102479
- MA: The MIT Press
- Sutton, R. S., and Barto, A. G. 1998. Reinforcement Learning: An Introduction. Cambridge, MA: The MIT Press
- (1998) Reinforcement Learning: An Introduction. Cambridge
- Sutton, R.S.¹ Barto, A.G.²

19
- 33845344721
- Learning tetris using the noisy cross-entropy method
- DOI 10.1162/neco.2006.18.12.2936
- Szita, I., and Lörincz, A. 2006. Learning Tetris Using the Noisy Cross-Entropy Method. Neural Computation 18(12): 2936-2941. (Pubitemid 44879147)
- (2006) Neural Computation , vol.18 , Issue.12 , pp. 2936-2941
- Szita, I.¹ Lorincz, A.²

20
- 70449370276
- RL-Glue: Language-independent software for reinforcement-learning experiments
- September
- Tanner, B., and White, A. 2009. RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments. Journal of Machine Learning Research 10 (September): 2133-2136.
- (2009) Journal of Machine Learning Research , vol.10 , pp. 2133-2136
- Tanner, B.¹ White, A.²

21
- 79951880135
- Master's Thesis, Department of Computing, University of Alberta, Edmonton, Alberta, Canada
- White, A. 2006. A Standard Benchmarking System for Reinforcement Learning. Master's Thesis, Department of Computing, University of Alberta, Edmonton, Alberta, Canada.
- (2006) A Standard Benchmarking System for Reinforcement Learning
- White, A.¹

22
- 84869461477
- Generalized domains for empirical evaluations in reinforcement learning
- Paper presented Montreal, Quebec, Canada, 25 March
- Whiteson, S.; Tanner, B.; Taylor, M. E.; and Stone, P. 2009. Generalized Domains for Empirical Evaluations in Reinforcement Learning. Paper presented at the 4th Workshop on Evaluation Methods for Machine Learning, Montreal, Quebec, Canada, 25 March.
- (2009) 4th Workshop on Evaluation Methods for Machine Learning
- Whiteson, S.¹ Tanner, B.² Taylor, M.E.³ Stone, P.⁴

23
- 23044435398
- Dynamic model of the octopus arm. I. biomechanics of the octopus reaching movement
- DOI 10.1152/jn.00684.2004
- Yekutieli, Y.; Sagiv-Zohar, R.; Aharonov, R.; Engel, Y.; Hochner, B.; and Flash, T. 2005. A Dynamic Model of the Octopus Arm. I. Biomechanics of the Octopus Reaching Movement. Journal of Neurophysiology 94(2): 1443-1458. (Pubitemid 41061378)
- (2005) Journal of Neurophysiology , vol.94 , Issue.2 , pp. 1443-1458
- Yekutieli, Y.¹ Sagiv-Zohar, R.² Aharonov, R.³ Engel, Y.⁴ Hochner, B.⁵ Flash, T.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.