메뉴 건너뛰기




Volumn 31, Issue 2, 2010, Pages 81-94

The reinforcement learning competitions

Author keywords

[No Author keywords available]

Indexed keywords

INTELLIGENT AGENTS; MACHINE LEARNING;

EID: 79951878534     PISSN: 07384602     EISSN: None     Source Type: Journal    
DOI: 10.1609/aimag.v31i2.2227     Document Type: Article
Times cited : (41)

References (23)
  • 1
    • 0034859944 scopus 로고    scopus 로고
    • Autonomous helicopter control using reinforcement learning policy search methods
    • Piscataway NJ: Institute of Electrical and Electronics Engineers, Inc.
    • Bagnell, J., and Schneider, J. 2001. Autonomous Helicopter Control Using Reinforcement Learning Policy Search Methods. In Proceedings of the International Conference on Robotics and Automation 2001, 1615-1620. Piscataway NJ: Institute of Electrical and Electronics Engineers, Inc.
    • (2001) Proceedings of the International Conference on Robotics and Automation 2001 , pp. 1615-1620
    • Bagnell, J.1    Schneider, J.2
  • 2
    • 0004870746 scopus 로고
    • A problem in the sequential design of experiments
    • Bellman, R. E. 1956. A Problem in the Sequential Design of Experiments. Sankhya 16(3,4): 221-229.
    • (1956) Sankhya , vol.16 , Issue.3-4 , pp. 221-229
    • Bellman, R.E.1
  • 4
    • 85153940465 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • Cambridge, MA: The MIT Press
    • Boyan, J. A., and Moore, A. W. 1995. Generalization in Reinforcement Learning: Safely Approximating the Value Function. In Advances in Neural Information Processing Systems 7, 369-376. Cambridge, MA: The MIT Press.
    • (1995) Advances in Neural Information Processing Systems 7 , pp. 369-376
    • Boyan, J.A.1    Moore, A.W.2
  • 5
    • 0028605089 scopus 로고
    • Swinging up the acrobot: An example of intelligent control
    • Piscataway, NJ: Institute of Electrical and Electronics Engineers, Inc.
    • Dejong, G., and Spong, M. W. 1994. Swinging Up the Acrobot: An Example of Intelligent Control. In Proceedings of the American Control Conference, 2158-2162. Piscataway, NJ: Institute of Electrical and Electronics Engineers, Inc.
    • (1994) Proceedings of the American Control Conference , pp. 2158-2162
    • Dejong, G.1    Spong, M.W.2
  • 7
    • 56449093331 scopus 로고    scopus 로고
    • An objectoriented representation for efficient reinforcement learning
    • New York: Association for Computing Machinery
    • Diuk, C.; Cohen, A.; and Littman, M. 2008. An ObjectOriented Representation for Efficient Reinforcement Learning. In Proceedings of the 25th International Conference on Machine Learning, 240-247. New York: Association for Computing Machinery.
    • (2008) Proceedings of the 25th International Conference on Machine Learning , pp. 240-247
    • Diuk, C.1    Cohen, A.2    Littman, M.3
  • 12
    • 0032021222 scopus 로고    scopus 로고
    • Soccer server: A tool for research on multiagent systems
    • Noda, I.; Matsubara, H.; Hiraki, K.; and Frank, I. 1998. Soccer Server: A Tool for Research on Multiagent Systems. Applied Artificial Intelligence 12(1): 233-250. (Pubitemid 127619180)
    • (1998) Applied Artificial Intelligence , vol.12 , Issue.2-3 , pp. 233-250
    • Noda, I.1    Matsubara, H.2    Hiraki, K.3    Frank, I.4
  • 14
    • 37249034293 scopus 로고    scopus 로고
    • Keepaway soccer: From machine learning testbed to benchmark
    • Berlin: Springer Verlag
    • Stone, P.; Kuhlmann, G.; Taylor, M. E.; and Liu, Y. 2005. Keepaway Soccer: From Machine Learning Testbed to Benchmark. In Robocup-2005: Robot Soccer World Cup IX, Volume 4020, 93-105. Berlin: Springer Verlag.
    • (2005) Robocup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
    • Stone, P.1    Kuhlmann, G.2    Taylor, M.E.3    Liu, Y.4
  • 15
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning in robocup-soccer keepaway
    • Stone, P.; Sutton, R. S.; and Kuhlmann, G. 2005. Reinforcement Learning in Robocup-Soccer Keepaway. Adaptive Behavior 13(3): 165-188.
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 16
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • Cambridge, MA: The MIT Press
    • Sutton, R. S. 1996. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. In Proceedings of Advances in Neural Information Processing Systems 8, 1038-1044. Cambridge, MA: The MIT Press.
    • (1996) Proceedings of Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.S.1
  • 17
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R. S. 1988. Learning to Predict by the Methods of Temporal Differences. Machine Learning 3(1): 9-44.
    • (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
    • Sutton, R.S.1
  • 19
    • 33845344721 scopus 로고    scopus 로고
    • Learning tetris using the noisy cross-entropy method
    • DOI 10.1162/neco.2006.18.12.2936
    • Szita, I., and Lörincz, A. 2006. Learning Tetris Using the Noisy Cross-Entropy Method. Neural Computation 18(12): 2936-2941. (Pubitemid 44879147)
    • (2006) Neural Computation , vol.18 , Issue.12 , pp. 2936-2941
    • Szita, I.1    Lorincz, A.2
  • 20
    • 70449370276 scopus 로고    scopus 로고
    • RL-Glue: Language-independent software for reinforcement-learning experiments
    • September
    • Tanner, B., and White, A. 2009. RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments. Journal of Machine Learning Research 10 (September): 2133-2136.
    • (2009) Journal of Machine Learning Research , vol.10 , pp. 2133-2136
    • Tanner, B.1    White, A.2
  • 21
    • 79951880135 scopus 로고    scopus 로고
    • Master's Thesis, Department of Computing, University of Alberta, Edmonton, Alberta, Canada
    • White, A. 2006. A Standard Benchmarking System for Reinforcement Learning. Master's Thesis, Department of Computing, University of Alberta, Edmonton, Alberta, Canada.
    • (2006) A Standard Benchmarking System for Reinforcement Learning
    • White, A.1
  • 22
    • 84869461477 scopus 로고    scopus 로고
    • Generalized domains for empirical evaluations in reinforcement learning
    • Paper presented Montreal, Quebec, Canada, 25 March
    • Whiteson, S.; Tanner, B.; Taylor, M. E.; and Stone, P. 2009. Generalized Domains for Empirical Evaluations in Reinforcement Learning. Paper presented at the 4th Workshop on Evaluation Methods for Machine Learning, Montreal, Quebec, Canada, 25 March.
    • (2009) 4th Workshop on Evaluation Methods for Machine Learning
    • Whiteson, S.1    Tanner, B.2    Taylor, M.E.3    Stone, P.4
  • 23
    • 23044435398 scopus 로고    scopus 로고
    • Dynamic model of the octopus arm. I. biomechanics of the octopus reaching movement
    • DOI 10.1152/jn.00684.2004
    • Yekutieli, Y.; Sagiv-Zohar, R.; Aharonov, R.; Engel, Y.; Hochner, B.; and Flash, T. 2005. A Dynamic Model of the Octopus Arm. I. Biomechanics of the Octopus Reaching Movement. Journal of Neurophysiology 94(2): 1443-1458. (Pubitemid 41061378)
    • (2005) Journal of Neurophysiology , vol.94 , Issue.2 , pp. 1443-1458
    • Yekutieli, Y.1    Sagiv-Zohar, R.2    Aharonov, R.3    Engel, Y.4    Hochner, B.5    Flash, T.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.