메뉴 건너뛰기




Volumn 27, Issue 1, 2009, Pages 55-73

Reinforcement learning for robot soccer

Author keywords

Autonomous learning robots; Batch reinforcement learning; Learning mobile robots; Neural control; RoboCup

Indexed keywords

AUTONOMOUS LEARNING ROBOTS; BATCH REINFORCEMENT LEARNING; LEARNING MOBILE ROBOTS; NEURAL CONTROL; ROBOCUP;

EID: 67650996818     PISSN: 09295593     EISSN: None     Source Type: Journal    
DOI: 10.1007/s10514-009-9120-4     Document Type: Article
Times cited : (239)

References (45)
  • 1
    • 0033148990 scopus 로고    scopus 로고
    • Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development
    • M. Asada E. Uchibe K. Hosoda 1999 Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development Artificial Intelligence 110 2 275 292
    • (1999) Artificial Intelligence , vol.110 , Issue.2 , pp. 275-292
    • Asada, M.1    Uchibe, E.2    Hosoda, K.3
  • 4
    • 0003787146 scopus 로고
    • Princeton University Press Princeton
    • Bellman, R. (1957). Dynamic programming. Princeton: Princeton University Press.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 19
    • 33745207959 scopus 로고    scopus 로고
    • Motion estimation of moving objects for autonomous mobile robots
    • M. Lauer S. Lange M. Riedmiller 2006 Motion estimation of moving objects for autonomous mobile robots Kunstliche Intelligenz 20 1 11 17
    • (2006) Kunstliche Intelligenz , vol.20 , Issue.1 , pp. 11-17
    • Lauer, M.1    Lange, S.2    Riedmiller, M.3
  • 21
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • L. Lin 1992 Self-improving reactive agents based on reinforcement learning, planning and teaching Machine Learning 8 3 293 321
    • (1992) Machine Learning , vol.8 , Issue.3 , pp. 293-321
    • Lin, L.1
  • 25
  • 26
    • 4544333988 scopus 로고    scopus 로고
    • Reinforcement learning of humanoid rhythmic walking parameters based on visual information
    • M. Ogino Y. Katoh M. Aono M. Asada K. Hosoda 2004 Reinforcement learning of humanoid rhythmic walking parameters based on visual information Advanced Robotics 18 7 677 697
    • (2004) Advanced Robotics , vol.18 , Issue.7 , pp. 677-697
    • Ogino, M.1    Katoh, Y.2    Aono, M.3    Asada, M.4    Hosoda, K.5
  • 29
    • 38649095925 scopus 로고    scopus 로고
    • Learning to control in operational space
    • DOI 10.1177/0278364907087548
    • J. Peters S. Schaal 2008 Learning to control in operational space The International Journal of Robotics Research 27 2 197 212 (Pubitemid 351169714)
    • (2008) International Journal of Robotics Research , vol.27 , Issue.2 , pp. 197-212
    • Peters, J.1    Schaal, S.2
  • 30
    • 44949241322 scopus 로고    scopus 로고
    • Reinforcement learning of motor skills with policy gradients
    • J. Peters S. Schaal 2008 Reinforcement learning of motor skills with policy gradients Neural Networks 21 4 682 697
    • (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
    • Peters, J.1    Schaal, S.2
  • 34
    • 84943274699 scopus 로고
    • Direct adaptive method for faster backpropagation learning: The RPROP algorithm
    • Riedmiller, M., & Braun, H., (1993). A direct adaptive method for faster backpropagation learning: the RPROP algorithm. In H. Ruspini (Ed.), Proceedings of the IEEE international conference on neural networks (ICNN) (pp. 586-591), San Francisco. (Pubitemid 23662229)
    • (1993) 1993 IEEE International Conference on Neural Networks , pp. 586-591
    • Riedmiller Martin1    Braun Heinrich2
  • 38
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCup-soccer keepaway
    • P. Stone R. Sutton G. Kuhlmann 2005 Reinforcement learning for RoboCup-soccer keepaway Adaptive Behavior 13 3 165 188
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.2    Kuhlmann, G.3
  • 42
    • 0024702037 scopus 로고
    • A parallel network that learns to play backgammon
    • G. Tesauro T. Sejnowski 1989 A parallel network that learns to play backgammon Artificial Intelligence 39 3 357 390
    • (1989) Artificial Intelligence , vol.39 , Issue.3 , pp. 357-390
    • Tesauro, G.1    Sejnowski, T.2
  • 43
    • 3342953146 scopus 로고    scopus 로고
    • Real-time object tracking for soccer-robots without color information
    • A. Treptow A. Zell 2004 Real-time object tracking for soccer-robots without color information Robotics and Autonomous Systems 48 1 41 48
    • (2004) Robotics and Autonomous Systems , vol.48 , Issue.1 , pp. 41-48
    • Treptow, A.1    Zell, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.