-
1
-
-
0033148990
-
Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development
-
M. Asada E. Uchibe K. Hosoda 1999 Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based reinforcement learning and development Artificial Intelligence 110 2 275 292
-
(1999)
Artificial Intelligence
, vol.110
, Issue.2
, pp. 275-292
-
-
Asada, M.1
Uchibe, E.2
Hosoda, K.3
-
3
-
-
33845878062
-
Predicting away robot control latency
-
Springer Berlin
-
Behnke, S., Egorova, A., Gloye, A., Rojas, R., & Simon, M. (2003). Predicting away robot control latency. In D. Polani, B. Browning, A. Bonarini, & K. Yoshida (Eds.), LNCS. RoboCup 2003: robot soccer world cup VII (pp. 712-719), Padua, Italy. Berlin: Springer.
-
(2003)
RoboCup 2003: Robot Soccer World Cup VII Padua, Italy LNCS
, pp. 712-719
-
-
Behnke, S.1
Egorova, A.2
Gloye, A.3
Rojas, R.4
Simon, M.5
Polani, D.6
Browning, B.7
Bonarini, A.8
Yoshida, K.9
-
4
-
-
0003787146
-
-
Princeton University Press Princeton
-
Bellman, R. (1957). Dynamic programming. Princeton: Princeton University Press.
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
10
-
-
34548729592
-
Bridging the gap: Learning in the RoboCup simulation and midsize league
-
Porto, Portugal
-
Gabel, T., Hafner, R., Lange, S., Lauer, M., & Riedmiller, M. (2006). Bridging the gap: learning in the RoboCup simulation and midsize league. In Proceedings of the 7th Portuguese conference on automatic control (Controlo 2006), Porto, Portugal.
-
(2006)
Proceedings of the 7th Portuguese Conference on Automatic Control (Controlo 2006)
-
-
Gabel, T.1
Hafner, R.2
Lange, S.3
Lauer, M.4
Riedmiller, M.5
-
11
-
-
70349314519
-
A case study on improving defense behavior in soccer simulation 2D: The NeuroHassle approach
-
Springer Berlin
-
Gabel, T., Riedmiller, M., & Trost, F. (2008). A case study on improving defense behavior in soccer simulation 2D: the NeuroHassle approach. In Iocchi, L., Matsubara, H., Weitzenfeld, A., & Zhou, C. (Eds.), LNCS. RoboCup 2008: robot soccer world cup XII, Suzhou, China. Berlin: Springer.
-
(2008)
LNCS. RoboCup 2008: Robot Soccer World Cup XII Suzhou, China
-
-
Gabel, T.1
Riedmiller, M.2
Trost, F.3
Iocchi, L.4
Matsubara, H.5
Weitzenfeld, A.6
Zhou, C.7
-
12
-
-
84880694195
-
Stable function approximation in dynamic programming
-
Morgan Kaufmann San Mateo
-
Gordon, G., Prieditis, A., & Russell, S. (1995). Stable function approximation in dynamic programming. In Proceedings of the twelfth international conference on machine learning (ICML 1995) (pp. 261-268), Tahoe City, USA. San Mateo: Morgan Kaufmann.
-
(1995)
Proceedings of the Twelfth International Conference on Machine Learning (ICML 1995) Tahoe City, USA
, pp. 261-268
-
-
Gordon, G.1
Prieditis, A.2
Russell, S.3
-
14
-
-
67650793884
-
Visual robot detection in RoboCup using neural networks
-
Springer Berlin
-
Kaufmann, U., Mayer, G., Kraetzschmar, G., & Palm, G. (2004). Visual robot detection in RoboCup using neural networks. In D. Nardi, M. Riedmiller, C. Sammut, & J. Santos-Victor (Eds.), LNCS. RoboCup 2004: robot soccer world cup VIII (pp. 310-322), Porto, Portugal. Berlin: Springer.
-
(2004)
RoboCup 2004: Robot Soccer World Cup VIII Porto, Portugal LNCS
, pp. 310-322
-
-
Kaufmann, U.1
Mayer, G.2
Kraetzschmar, G.3
Palm, G.4
Nardi, D.5
Riedmiller, M.6
Sammut, C.7
Santos-Victor, J.8
-
16
-
-
67650835709
-
Learning perceptual coupling for motor primitives
-
IEEE Press New York
-
Kober, J., Mohler, B., & Peters, J. (2008). Learning perceptual coupling for motor primitives. In Proceedings of the 2008 IEEE/RSJ international conference on intelligent robots and systems (IROS 2008) (pp. 834-839), Nice, France. New York: IEEE Press.
-
(2008)
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2008) Nice, France
, pp. 834-839
-
-
Kober, J.1
Mohler, B.2
Peters, J.3
-
18
-
-
37249048510
-
Calculating the perfect match: An efficient and accurate approach for robot self-localization
-
Springer Berlin
-
Lauer, M., Lange, S., & Riedmiller, M. (2005). Calculating the perfect match: an efficient and accurate approach for robot self-localization. In A. Bredenfeld, A. Jacoff, I. Noda, & Y. Takahashi (Eds.), LNCS. RoboCup 2005: robot soccer world cup IX (pp. 142-153), Osaka, Japan. Berlin: Springer.
-
(2005)
RoboCup 2005: Robot Soccer World Cup IX Osaka, Japan LNCS
, pp. 142-153
-
-
Lauer, M.1
Lange, S.2
Riedmiller, M.3
Bredenfeld, A.4
Jacoff, A.5
Noda, I.6
Takahashi, Y.7
-
19
-
-
33745207959
-
Motion estimation of moving objects for autonomous mobile robots
-
M. Lauer S. Lange M. Riedmiller 2006 Motion estimation of moving objects for autonomous mobile robots Kunstliche Intelligenz 20 1 11 17
-
(2006)
Kunstliche Intelligenz
, vol.20
, Issue.1
, pp. 11-17
-
-
Lauer, M.1
Lange, S.2
Riedmiller, M.3
-
20
-
-
1442302498
-
An adaptive color segmentation algorithm for Sony legged robots
-
IASTED/ACTA Press New York
-
Li, B., Hu, H., & Spacek, L. (2003). An adaptive color segmentation algorithm for Sony legged robots. In The 21st IASTED international multi-conference on applied informatics (AI 2003) (pp. 126-131), Innsbruck, Austria. New York: IASTED/ACTA Press.
-
(2003)
The 21st IASTED International Multi-conference on Applied Informatics (AI 2003) Innsbruck, Austria
, pp. 126-131
-
-
Li, B.1
Hu, H.2
Spacek, L.3
-
21
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
L. Lin 1992 Self-improving reactive agents based on reinforcement learning, planning and teaching Machine Learning 8 3 293 321
-
(1992)
Machine Learning
, vol.8
, Issue.3
, pp. 293-321
-
-
Lin, L.1
-
22
-
-
84863371099
-
Combining policy search with planning in multi-agent cooperation
-
Springer Berlin
-
Ma, J., & Cameron, S. (2008). Combining policy search with planning in multi-agent cooperation. In L. Iocchi, H. Matsubara, A. Weitzenfeld, & C. Zhou (Eds.), LNAI. RoboCup 2008: robot soccer world cup XII, Suzhou, China. Berlin: Springer.
-
(2008)
RoboCup 2008: Robot Soccer World Cup XII Suzhou, China LNAI
-
-
Ma, J.1
Cameron, S.2
Iocchi, L.3
Matsubara, H.4
Weitzenfeld, A.5
Zhou, C.6
-
23
-
-
67650813507
-
Performance evaluation of an evolutionary method for RoboCup soccer strategies
-
Springer Berlin
-
Nakashima, T., Takatani, M., Udo, M., Ishibuchi, H., & Nii, M. (2005). Performance evaluation of an evolutionary method for RoboCup soccer strategies. In A. Bredenfeld, A. Jacoff, I. Noda, & Y. Takahashi (Eds.), LNAI. RoboCup 2005: robot soccer world cup IX, Osaka, Japan. Berlin: Springer.
-
(2005)
RoboCup 2005: Robot Soccer World Cup IX Osaka, Japan LNAI
-
-
Nakashima, T.1
Takatani, M.2
Udo, M.3
Ishibuchi, H.4
Nii, M.5
Bredenfeld, A.6
Jacoff, A.7
Noda, I.8
Takahashi, Y.9
-
24
-
-
33744488034
-
Autonomous inverted helicopter flight via reinforcement learning
-
Springer Berlin
-
Ng, A., Coates, A., Diel, M., Ganapathi, V., Schulte, J., Tse, B., Berger, E., & Liang, E. (2004). Autonomous inverted helicopter flight via reinforcement learning. In Experimental robotics IX, the 9th international symposium on experimental robotics (ISER) (pp. 363-372), Singapore, China. Berlin: Springer.
-
(2004)
Experimental Robotics IX, the 9th International Symposium on Experimental Robotics (ISER) Singapore, China
, pp. 363-372
-
-
Ng, A.1
Coates, A.2
Diel, M.3
Ganapathi, V.4
Schulte, J.5
Tse, B.6
Berger, E.7
Liang, E.8
-
26
-
-
4544333988
-
Reinforcement learning of humanoid rhythmic walking parameters based on visual information
-
M. Ogino Y. Katoh M. Aono M. Asada K. Hosoda 2004 Reinforcement learning of humanoid rhythmic walking parameters based on visual information Advanced Robotics 18 7 677 697
-
(2004)
Advanced Robotics
, vol.18
, Issue.7
, pp. 677-697
-
-
Ogino, M.1
Katoh, Y.2
Aono, M.3
Asada, M.4
Hosoda, K.5
-
29
-
-
38649095925
-
Learning to control in operational space
-
DOI 10.1177/0278364907087548
-
J. Peters S. Schaal 2008 Learning to control in operational space The International Journal of Robotics Research 27 2 197 212 (Pubitemid 351169714)
-
(2008)
International Journal of Robotics Research
, vol.27
, Issue.2
, pp. 197-212
-
-
Peters, J.1
Schaal, S.2
-
30
-
-
44949241322
-
Reinforcement learning of motor skills with policy gradients
-
J. Peters S. Schaal 2008 Reinforcement learning of motor skills with policy gradients Neural Networks 21 4 682 697
-
(2008)
Neural Networks
, vol.21
, Issue.4
, pp. 682-697
-
-
Peters, J.1
Schaal, S.2
-
34
-
-
84943274699
-
Direct adaptive method for faster backpropagation learning: The RPROP algorithm
-
Riedmiller, M., & Braun, H., (1993). A direct adaptive method for faster backpropagation learning: the RPROP algorithm. In H. Ruspini (Ed.), Proceedings of the IEEE international conference on neural networks (ICNN) (pp. 586-591), San Francisco. (Pubitemid 23662229)
-
(1993)
1993 IEEE International Conference on Neural Networks
, pp. 586-591
-
-
Riedmiller Martin1
Braun Heinrich2
-
35
-
-
0346242076
-
Using machine learning techniques in complex multi-agent domains
-
Springer Berlin
-
Riedmiller, M., & Merke, A. (2003). Using machine learning techniques in complex multi-agent domains. In I. Stamatescu, W. Menzel, M. Richter, & U. Ratsch (Eds.), Adaptivity and learning. Berlin: Springer.
-
(2003)
Adaptivity and Learning
-
-
Riedmiller, M.1
Merke, A.2
Stamatescu, I.3
Menzel, W.4
Richter, M.5
Ratsch, U.6
-
36
-
-
79958857418
-
Learning to drive in 20 minutes
-
Springer Berlin
-
Riedmiller, M., Montemerlo, M., & Dahlkamp, H. (2007). Learning to drive in 20 minutes. In Proceedings of the FBIT 2007 conference, Jeju, Korea. Berlin: Springer.
-
(2007)
Proceedings of the FBIT 2007 Conference Jeju, Korea
-
-
Riedmiller, M.1
Montemerlo, M.2
Dahlkamp, H.3
-
37
-
-
26444463041
-
Evolutionary gait-optimization using a fitness function based on proprioception
-
Springer Berlin
-
Röfer, T. (2004). Evolutionary gait-optimization using a fitness function based on proprioception. In Nardi, D., Riedmiller, M., Sammut, C., & Santos-Victor, J. (Eds.), LNCS. RoboCup 2004: robot soccer world cup VIII (pp. 310-322), Porto, Portugal. Berlin: Springer.
-
(2004)
RoboCup 2004: Robot Soccer World Cup VIII Porto, Portugal LNCS
, pp. 310-322
-
-
Röfer, T.1
Nardi, D.2
Riedmiller, M.3
Sammut, C.4
Santos-Victor, J.5
-
38
-
-
27544506565
-
Reinforcement learning for RoboCup-soccer keepaway
-
P. Stone R. Sutton G. Kuhlmann 2005 Reinforcement learning for RoboCup-soccer keepaway Adaptive Behavior 13 3 165 188
-
(2005)
Adaptive Behavior
, vol.13
, Issue.3
, pp. 165-188
-
-
Stone, P.1
Sutton, R.2
Kuhlmann, G.3
-
40
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
MIT Press Cambridge
-
Sutton, R., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. In Advances in neural information processing systems 12 (NIPS 1999) (pp. 1057-1063), Denver, USA. Cambridge: MIT Press.
-
(2000)
Advances in Neural Information Processing Systems 12 (NIPS 1999) Denver, USA
, pp. 1057-1063
-
-
Sutton, R.1
Singh, S.2
Mansour, Y.3
-
42
-
-
0024702037
-
A parallel network that learns to play backgammon
-
G. Tesauro T. Sejnowski 1989 A parallel network that learns to play backgammon Artificial Intelligence 39 3 357 390
-
(1989)
Artificial Intelligence
, vol.39
, Issue.3
, pp. 357-390
-
-
Tesauro, G.1
Sejnowski, T.2
-
43
-
-
3342953146
-
Real-time object tracking for soccer-robots without color information
-
A. Treptow A. Zell 2004 Real-time object tracking for soccer-robots without color information Robotics and Autonomous Systems 48 1 41 48
-
(2004)
Robotics and Autonomous Systems
, vol.48
, Issue.1
, pp. 41-48
-
-
Treptow, A.1
Zell, A.2
-
45
-
-
84983135293
-
New developments in the application of automatic learning to power system control
-
Liege, Belgium
-
Wehenkel, L., Glavic, M., & Ernst, D. (2005). New developments in the application of automatic learning to power system control. In Proceedings of the 15th power systems computation conference (PSCC05), Liege, Belgium.
-
(2005)
Proceedings of the 15th Power Systems Computation Conference (PSCC05)
-
-
Wehenkel, L.1
Glavic, M.2
Ernst, D.3
|