SCOPUS 정보 검색 플랫폼

Autonomous Robots

Volumn 27, Issue 1, 2009, Pages 55-73

Reinforcement learning for robot soccer

(4) Riedmiller, Martin a Gabel, Thomas a Hafner, Roland a Lange, Sascha a

a UNIVERSITY OF FREIBURG (Germany)

Author keywords

Autonomous learning robots; Batch reinforcement learning; Learning mobile robots; Neural control; RoboCup

Indexed keywords

AUTONOMOUS LEARNING ROBOTS; BATCH REINFORCEMENT LEARNING; LEARNING MOBILE ROBOTS; NEURAL CONTROL; ROBOCUP;

MOBILE ROBOTS; NAVIGATION; PATTERN RECOGNITION SYSTEMS; REINFORCEMENT; REINFORCEMENT LEARNING;

EDUCATION;

EID: 67650996818 PISSN: 09295593 EISSN: None Source Type: Journal
DOI: 10.1007/s10514-009-9120-4 Document Type: Article

Times cited : (239)

References (45)

1
- 0033148990
- Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development
- M. Asada E. Uchibe K. Hosoda 1999 Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development Artificial Intelligence 110 2 275 292
- (1999) Artificial Intelligence , vol.110 , Issue.2 , pp. 275-292
- Asada, M.¹ Uchibe, E.² Hosoda, K.³

2
- 0034859944
- Autonomous helicopter control using reinforcement learning policy search methods
- IEEE Press New York
- Bagnell, J., & Schneider, J. (2001). Autonomous helicopter control using reinforcement learning policy search methods. In Proceedings of the 2001 IEEE international conference on robotics and automation (ICRA 2001) (pp. 1615-1620), Seoul, South Korea. New York: IEEE Press.
- (2001) Proceedings of the 2001 IEEE International Conference on Robotics and Automation (ICRA 2001) Seoul, South Korea , pp. 1615-1620
- Bagnell, J.¹ Schneider, J.²

3
- 33845878062
- Predicting away robot control latency
- Springer Berlin
- Behnke, S., Egorova, A., Gloye, A., Rojas, R., & Simon, M. (2003). Predicting away robot control latency. In D. Polani, B. Browning, A. Bonarini, & K. Yoshida (Eds.), LNCS. RoboCup 2003: robot soccer world cup VII (pp. 712-719), Padua, Italy. Berlin: Springer.
- (2003) RoboCup 2003: Robot Soccer World Cup VII Padua, Italy LNCS , pp. 712-719
- Behnke, S.¹ Egorova, A.² Gloye, A.³ Rojas, R.⁴ Simon, M.⁵ Polani, D.⁶ Browning, B.⁷ Bonarini, A.⁸ Yoshida, K.⁹

4
- 0003787146
- Princeton University Press Princeton
- Bellman, R. (1957). Dynamic programming. Princeton: Princeton University Press.
- (1957) Dynamic Programming
- Bellman, R.¹

5
- 0003487482
- Athena Scientific Belmont
- Bertsekas, D., & Tsitsiklis, J. (1996). Neuro dynamic programming. Belmont: Athena Scientific.
- (1996) Neuro Dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

6
- 14044273044
- An evolutionary approach to gait learning for four-legged robots
- IEEE Press New York
- Chernova, S., & Veloso, M. (2004). An evolutionary approach to gait learning for four-legged robots. In Proceedings of the 2004 IEEE/RSJ international conference on intelligent robots and systems (IROS 2004), Sendai, Japan. New York: IEEE Press.
- (2004) Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2004) Sendai, Japan
- Chernova, S.¹ Veloso, M.²

7
- 85156187730
- Improving elevator performance using reinforcement learning
- MIT Press Cambridge
- Crites, R., & Barto, A. (1995). Improving elevator performance using reinforcement learning. In Advances in neural information processing systems 8 (NIPS 1995) (pp. 1017-1023), Denver, USA. Cambridge: MIT Press.
- (1995) Advances in Neural Information Processing Systems 8 (NIPS 1995) Denver, USA , pp. 1017-1023
- Crites, R.¹ Barto, A.²

8
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst P. Geurts L. Wehenkel 2006 Tree-based batch mode reinforcement learning Journal of Machine Learning Research 6 1 503 556
- (2006) Journal of Machine Learning Research , vol.6 , Issue.1 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

9
- 79958814014
- Adaptive reactive job-shop scheduling with learning agents
- Gabel, T., & Riedmiller, M. (2007). Adaptive reactive job-shop scheduling with learning agents. International Journal of Information Technology and Intelligent Computing, 2(4).
- (2007) International Journal of Information Technology and Intelligent Computing , vol.2 , Issue.4
- Gabel, T.¹ Riedmiller, M.²

10
- 34548729592
- Bridging the gap: Learning in the RoboCup simulation and midsize league
- Porto, Portugal
- Gabel, T., Hafner, R., Lange, S., Lauer, M., & Riedmiller, M. (2006). Bridging the gap: learning in the RoboCup simulation and midsize league. In Proceedings of the 7th Portuguese conference on automatic control (Controlo 2006), Porto, Portugal.
- (2006) Proceedings of the 7th Portuguese Conference on Automatic Control (Controlo 2006)
- Gabel, T.¹ Hafner, R.² Lange, S.³ Lauer, M.⁴ Riedmiller, M.⁵

11
- 70349314519
- A case study on improving defense behavior in soccer simulation 2D: The NeuroHassle approach
- Springer Berlin
- Gabel, T., Riedmiller, M., & Trost, F. (2008). A case study on improving defense behavior in soccer simulation 2D: the NeuroHassle approach. In Iocchi, L., Matsubara, H., Weitzenfeld, A., & Zhou, C. (Eds.), LNCS. RoboCup 2008: robot soccer world cup XII, Suzhou, China. Berlin: Springer.
- (2008) LNCS. RoboCup 2008: Robot Soccer World Cup XII Suzhou, China
- Gabel, T.¹ Riedmiller, M.² Trost, F.³ Iocchi, L.⁴ Matsubara, H.⁵ Weitzenfeld, A.⁶ Zhou, C.⁷

12
- 84880694195
- Stable function approximation in dynamic programming
- Morgan Kaufmann San Mateo
- Gordon, G., Prieditis, A., & Russell, S. (1995). Stable function approximation in dynamic programming. In Proceedings of the twelfth international conference on machine learning (ICML 1995) (pp. 261-268), Tahoe City, USA. San Mateo: Morgan Kaufmann.
- (1995) Proceedings of the Twelfth International Conference on Machine Learning (ICML 1995) Tahoe City, USA , pp. 261-268
- Gordon, G.¹ Prieditis, A.² Russell, S.³

13
- 36348930983
- Neural reinforcement learning controllers for a real robot application
- IEEE Press New York
- Hafner, R., & Riedmiller, M. (2007). Neural reinforcement learning controllers for a real robot application. In Proceedings of the IEEE international conference on robotics and automation (ICRA 07), Rome, Italy. New York: IEEE Press.
- (2007) Proceedings of the IEEE International Conference on Robotics and Automation (ICRA 07) Rome, Italy
- Hafner, R.¹ Riedmiller, M.²

14
- 67650793884
- Visual robot detection in RoboCup using neural networks
- Springer Berlin
- Kaufmann, U., Mayer, G., Kraetzschmar, G., & Palm, G. (2004). Visual robot detection in RoboCup using neural networks. In D. Nardi, M. Riedmiller, C. Sammut, & J. Santos-Victor (Eds.), LNCS. RoboCup 2004: robot soccer world cup VIII (pp. 310-322), Porto, Portugal. Berlin: Springer.
- (2004) RoboCup 2004: Robot Soccer World Cup VIII Porto, Portugal LNCS , pp. 310-322
- Kaufmann, U.¹ Mayer, G.² Kraetzschmar, G.³ Palm, G.⁴ Nardi, D.⁵ Riedmiller, M.⁶ Sammut, C.⁷ Santos-Victor, J.⁸

15
- 0003979126
- Springer Berlin
- Kitano, H. (Ed.). (1997). RoboCup-97: robot soccer world cup I. Berlin: Springer.
- (1997) RoboCup-97: Robot Soccer World Cup i
- Kitano, H.¹

16
- 67650835709
- Learning perceptual coupling for motor primitives
- IEEE Press New York
- Kober, J., Mohler, B., & Peters, J. (2008). Learning perceptual coupling for motor primitives. In Proceedings of the 2008 IEEE/RSJ international conference on intelligent robots and systems (IROS 2008) (pp. 834-839), Nice, France. New York: IEEE Press.
- (2008) Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2008) Nice, France , pp. 834-839
- Kober, J.¹ Mohler, B.² Peters, J.³

17
- 4644323293
- Least-squares policy iteration
- M. Lagoudakis R. Parr 2003 Least-squares policy iteration Journal of Machine Learning Research 4 1107 1149
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.¹ Parr, R.²

18
- 37249048510
- Calculating the perfect match: An efficient and accurate approach for robot self-localization
- Springer Berlin
- Lauer, M., Lange, S., & Riedmiller, M. (2005). Calculating the perfect match: an efficient and accurate approach for robot self-localization. In A. Bredenfeld, A. Jacoff, I. Noda, & Y. Takahashi (Eds.), LNCS. RoboCup 2005: robot soccer world cup IX (pp. 142-153), Osaka, Japan. Berlin: Springer.
- (2005) RoboCup 2005: Robot Soccer World Cup IX Osaka, Japan LNCS , pp. 142-153
- Lauer, M.¹ Lange, S.² Riedmiller, M.³ Bredenfeld, A.⁴ Jacoff, A.⁵ Noda, I.⁶ Takahashi, Y.⁷

19
- 33745207959
- Motion estimation of moving objects for autonomous mobile robots
- M. Lauer S. Lange M. Riedmiller 2006 Motion estimation of moving objects for autonomous mobile robots Kunstliche Intelligenz 20 1 11 17
- (2006) Kunstliche Intelligenz , vol.20 , Issue.1 , pp. 11-17
- Lauer, M.¹ Lange, S.² Riedmiller, M.³

20
- 1442302498
- An adaptive color segmentation algorithm for Sony legged robots
- IASTED/ACTA Press New York
- Li, B., Hu, H., & Spacek, L. (2003). An adaptive color segmentation algorithm for Sony legged robots. In The 21st IASTED international multi-conference on applied informatics (AI 2003) (pp. 126-131), Innsbruck, Austria. New York: IASTED/ACTA Press.
- (2003) The 21st IASTED International Multi-conference on Applied Informatics (AI 2003) Innsbruck, Austria , pp. 126-131
- Li, B.¹ Hu, H.² Spacek, L.³

21
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L. Lin 1992 Self-improving reactive agents based on reinforcement learning, planning and teaching Machine Learning 8 3 293 321
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 293-321
- Lin, L.¹

22
- 84863371099
- Combining policy search with planning in multi-agent cooperation
- Springer Berlin
- Ma, J., & Cameron, S. (2008). Combining policy search with planning in multi-agent cooperation. In L. Iocchi, H. Matsubara, A. Weitzenfeld, & C. Zhou (Eds.), LNAI. RoboCup 2008: robot soccer world cup XII, Suzhou, China. Berlin: Springer.
- (2008) RoboCup 2008: Robot Soccer World Cup XII Suzhou, China LNAI
- Ma, J.¹ Cameron, S.² Iocchi, L.³ Matsubara, H.⁴ Weitzenfeld, A.⁵ Zhou, C.⁶

23
- 67650813507
- Performance evaluation of an evolutionary method for RoboCup soccer strategies
- Springer Berlin
- Nakashima, T., Takatani, M., Udo, M., Ishibuchi, H., & Nii, M. (2005). Performance evaluation of an evolutionary method for RoboCup soccer strategies. In A. Bredenfeld, A. Jacoff, I. Noda, & Y. Takahashi (Eds.), LNAI. RoboCup 2005: robot soccer world cup IX, Osaka, Japan. Berlin: Springer.
- (2005) RoboCup 2005: Robot Soccer World Cup IX Osaka, Japan LNAI
- Nakashima, T.¹ Takatani, M.² Udo, M.³ Ishibuchi, H.⁴ Nii, M.⁵ Bredenfeld, A.⁶ Jacoff, A.⁷ Noda, I.⁸ Takahashi, Y.⁹

24
- 33744488034
- Autonomous inverted helicopter flight via reinforcement learning
- Springer Berlin
- Ng, A., Coates, A., Diel, M., Ganapathi, V., Schulte, J., Tse, B., Berger, E., & Liang, E. (2004). Autonomous inverted helicopter flight via reinforcement learning. In Experimental robotics IX, the 9th international symposium on experimental robotics (ISER) (pp. 363-372), Singapore, China. Berlin: Springer.
- (2004) Experimental Robotics IX, the 9th International Symposium on Experimental Robotics (ISER) Singapore, China , pp. 363-372
- Ng, A.¹ Coates, A.² Diel, M.³ Ganapathi, V.⁴ Schulte, J.⁵ Tse, B.⁶ Berger, E.⁷ Liang, E.⁸

25
- 0032021222
- Soccer server: A tool for research on multiagent systems
- I. Noda H. Matsubara K. Hiraki I. Frank 1998 Soccer server: a tool for research on multi-agent systems Applied Artificial Intelligence 12 2-3 233 250 (Pubitemid 127619180)
- (1998) Applied Artificial Intelligence , vol.12 , Issue.2-3 , pp. 233-250
- Noda, I.¹ Matsubara, H.² Hiraki, K.³ Frank, I.⁴

26
- 4544333988
- Reinforcement learning of humanoid rhythmic walking parameters based on visual information
- M. Ogino Y. Katoh M. Aono M. Asada K. Hosoda 2004 Reinforcement learning of humanoid rhythmic walking parameters based on visual information Advanced Robotics 18 7 677 697
- (2004) Advanced Robotics , vol.18 , Issue.7 , pp. 677-697
- Ogino, M.¹ Katoh, Y.² Aono, M.³ Asada, M.⁴ Hosoda, K.⁵

27
- 28444471207
- Kinematic and dynamic adaptive control of a nonholonomic mobile robot using a RNN
- IEEE Press New York
- Oubbati, M., Schanz, M., & Levi, P. (2005). Kinematic and dynamic adaptive control of a nonholonomic mobile robot using a RNN. In Proceedings of the 20005 IEEE international symposium on computational intelligence in robotics and automation (CIRA 2005) (pp. 27-33). New York: IEEE Press.
- (2005) Proceedings of the 20005 IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA 2005) , pp. 27-33
- Oubbati, M.¹ Schanz, M.² Levi, P.³

28
- 34250635407
- Policy gradient methods for robotics
- IEEE Press New York
- Peters, J., & Schaal, S. (2006). Policy gradient methods for robotics. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS), Beijing, China. New York: IEEE Press.
- (2006) Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Beijing, China
- Peters, J.¹ Schaal, S.²

29
- 38649095925
- Learning to control in operational space
- DOI 10.1177/0278364907087548
- J. Peters S. Schaal 2008 Learning to control in operational space The International Journal of Robotics Research 27 2 197 212 (Pubitemid 351169714)
- (2008) International Journal of Robotics Research , vol.27 , Issue.2 , pp. 197-212
- Peters, J.¹ Schaal, S.²

30
- 44949241322
- Reinforcement learning of motor skills with policy gradients
- J. Peters S. Schaal 2008 Reinforcement learning of motor skills with policy gradients Neural Networks 21 4 682 697
- (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
- Peters, J.¹ Schaal, S.²

31
- 85102627959
- Wiley-Interscience New York
- Puterman, M. (2005). Markov decision processes: discrete stochastic dynamic programming. New York: Wiley-Interscience.
- (2005) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

32
- 0009267623
- Generating continuous control signals for reinforcement controllers using dynamic output elements
- Bruges, Belgium
- Riedmiller, M. (1997). Generating continuous control signals for reinforcement controllers using dynamic output elements. In Proceedings of the European symposium on artificial neural networks (ESANN 1997), Bruges, Belgium.
- (1997) Proceedings of the European Symposium on Artificial Neural Networks (ESANN 1997)
- Riedmiller, M.¹

33
- 33646687423
- Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
- Springer Berlin
- Riedmiller, M. (2005). Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method. In Machine learning: ECML 2005, 16th European conference on machine learning, Porto, Portugal. Berlin: Springer.
- (2005) Machine Learning: ECML 2005, 16th European Conference on Machine Learning Porto, Portugal
- Riedmiller, M.¹

34
- 84943274699
- Direct adaptive method for faster backpropagation learning: The RPROP algorithm
- Riedmiller, M., & Braun, H., (1993). A direct adaptive method for faster backpropagation learning: the RPROP algorithm. In H. Ruspini (Ed.), Proceedings of the IEEE international conference on neural networks (ICNN) (pp. 586-591), San Francisco. (Pubitemid 23662229)
- (1993) 1993 IEEE International Conference on Neural Networks , pp. 586-591
- Riedmiller Martin¹ Braun Heinrich²

35
- 0346242076
- Using machine learning techniques in complex multi-agent domains
- Springer Berlin
- Riedmiller, M., & Merke, A. (2003). Using machine learning techniques in complex multi-agent domains. In I. Stamatescu, W. Menzel, M. Richter, & U. Ratsch (Eds.), Adaptivity and learning. Berlin: Springer.
- (2003) Adaptivity and Learning
- Riedmiller, M.¹ Merke, A.² Stamatescu, I.³ Menzel, W.⁴ Richter, M.⁵ Ratsch, U.⁶

36
- 79958857418
- Learning to drive in 20 minutes
- Springer Berlin
- Riedmiller, M., Montemerlo, M., & Dahlkamp, H. (2007). Learning to drive in 20 minutes. In Proceedings of the FBIT 2007 conference, Jeju, Korea. Berlin: Springer.
- (2007) Proceedings of the FBIT 2007 Conference Jeju, Korea
- Riedmiller, M.¹ Montemerlo, M.² Dahlkamp, H.³

37
- 26444463041
- Evolutionary gait-optimization using a fitness function based on proprioception
- Springer Berlin
- Röfer, T. (2004). Evolutionary gait-optimization using a fitness function based on proprioception. In Nardi, D., Riedmiller, M., Sammut, C., & Santos-Victor, J. (Eds.), LNCS. RoboCup 2004: robot soccer world cup VIII (pp. 310-322), Porto, Portugal. Berlin: Springer.
- (2004) RoboCup 2004: Robot Soccer World Cup VIII Porto, Portugal LNCS , pp. 310-322
- Röfer, T.¹ Nardi, D.² Riedmiller, M.³ Sammut, C.⁴ Santos-Victor, J.⁵

38
- 27544506565
- Reinforcement learning for RoboCup-soccer keepaway
- P. Stone R. Sutton G. Kuhlmann 2005 Reinforcement learning for RoboCup-soccer keepaway Adaptive Behavior 13 3 165 188
- (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
- Stone, P.¹ Sutton, R.² Kuhlmann, G.³

39
- 0004102479
- MIT Press/A Bradford Book Cambridge
- Sutton, R., & Barto, A. (1998). Reinforcement learning. An introduction. Cambridge: MIT Press/A Bradford Book.
- (1998) Reinforcement Learning. An Introduction
- Sutton, R.¹ Barto, A.²

40
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- MIT Press Cambridge
- Sutton, R., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. In Advances in neural information processing systems 12 (NIPS 1999) (pp. 1057-1063), Denver, USA. Cambridge: MIT Press.
- (2000) Advances in Neural Information Processing Systems 12 (NIPS 1999) Denver, USA , pp. 1057-1063
- Sutton, R.¹ Singh, S.² Mansour, Y.³

41
- 0001332415
- On-line policy improvement using Monte Carlo search
- Springer Berlin
- Tesauro, G., & Galpering, G. (1995). On-line policy improvement using Monte Carlo search. In Neural information processing systems (NIPS 1996) (pp. 206-221), Denver, USA. Berlin: Springer.
- (1995) Neural Information Processing Systems (NIPS 1996) Denver, USA , pp. 206-221
- Tesauro, G.¹ Galpering, G.²

42
- 0024702037
- A parallel network that learns to play backgammon
- G. Tesauro T. Sejnowski 1989 A parallel network that learns to play backgammon Artificial Intelligence 39 3 357 390
- (1989) Artificial Intelligence , vol.39 , Issue.3 , pp. 357-390
- Tesauro, G.¹ Sejnowski, T.²

43
- 3342953146
- Real-time object tracking for soccer-robots without color information
- A. Treptow A. Zell 2004 Real-time object tracking for soccer-robots without color information Robotics and Autonomous Systems 48 1 41 48
- (2004) Robotics and Autonomous Systems , vol.48 , Issue.1 , pp. 41-48
- Treptow, A.¹ Zell, A.²

44
- 34249833101
- C. Watkins P. Dayan 1992 Q-learning Machine Learning 8 279 292
- (1992) Q-learning Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

45
- 84983135293
- New developments in the application of automatic learning to power system control
- Liege, Belgium
- Wehenkel, L., Glavic, M., & Ernst, D. (2005). New developments in the application of automatic learning to power system control. In Proceedings of the 15th power systems computation conference (PSCC05), Liege, Belgium.
- (2005) Proceedings of the 15th Power Systems Computation Conference (PSCC05)
- Wehenkel, L.¹ Glavic, M.² Ernst, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.