-
2
-
-
0006221144
-
Vision-based behavior acquisition for a shooting robot by using a reinforcement learning
-
M. Asada, S. Noda, S. Tawaratsumida, K. Hosoda, Vision-based behavior acquisition for a shooting robot by using a reinforcement learning, in: Proceedings of IAPR/IEEE Workshop on Visual Behaviors-1994, pp. 112-118, 1994.
-
(1994)
Proceedings of IAPR/IEEE Workshop on Visual Behaviors-1994
, pp. 112-118
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
3
-
-
0030647149
-
Reinforcement learning in the multi-robot domain
-
M.J. Mataric Reinforcement learning in the multi-robot domain Autonomous Robots 4 1997 73 83
-
(1997)
Autonomous Robots
, vol.4
, pp. 73-83
-
-
Mataric, M.J.1
-
5
-
-
84878320217
-
Modular reinforcement learning: An application to a real robot task
-
A. Birk, J. Demiris, LNCS Springer Berlin Heidelberg
-
Z. Kalmar, C. Szepesvari, and A. Lorincz Modular reinforcement learning: an application to a real robot task A. Birk, J. Demiris, Learning Robots LNCS vol. 1545 1998 Springer Berlin Heidelberg 29 45
-
(1998)
Learning Robots
, vol.1545
, pp. 29-45
-
-
Kalmar, Z.1
Szepesvari, C.2
Lorincz, A.3
-
6
-
-
77954595557
-
On the potential contributions of hybrid intelligent approaches to multicomponent robotic system development
-
R.J. Duro, M. Graña, and J. de Lope On the potential contributions of hybrid intelligent approaches to multicomponent robotic system development Information Sciences 180 14 2010 2635 2648
-
(2010)
Information Sciences
, vol.180
, Issue.14
, pp. 2635-2648
-
-
Duro, R.J.1
Graña, M.2
De Lope, J.3
-
7
-
-
77954575291
-
Linked multicomponent robotic systems: Basic assessment of linking element dynamical effect
-
E. Corchado, M. Graña, A. Savio, Springer Verlag
-
B. Fernandez-Gauna, J.M. Lopez-Guede, and E. Zulueta Linked multicomponent robotic systems: basic assessment of linking element dynamical effect E. Corchado, M. Graña, A. Savio, Hybrid Artificial Intelligence Systems, Part I, Vol. 6076 2010 Springer Verlag 73 79
-
(2010)
Hybrid Artificial Intelligence Systems, Part I, Vol. 6076
, pp. 73-79
-
-
Fernandez-Gauna, B.1
Lopez-Guede, J.M.2
Zulueta, E.3
-
8
-
-
79952160640
-
Learning hose transport control with Q-learning
-
B. Fernandez-Gauna, J.M. Lopez-Guede, E. Zulueta, and M. Graña Learning hose transport control with Q-learning Neural Network World 20 7 2010 913 923
-
(2010)
Neural Network World
, vol.20
, Issue.7
, pp. 913-923
-
-
Fernandez-Gauna, B.1
Lopez-Guede, J.M.2
Zulueta, E.3
Graña, M.4
-
10
-
-
68949157375
-
Transfer learning for reinforcement learning domains: A survey
-
M.E. Taylor, and P. Stone Transfer learning for reinforcement learning domains: a survey Journal of Machine Learning Research 10 1 2009 1633 1685
-
(2009)
Journal of Machine Learning Research
, vol.10
, Issue.1
, pp. 1633-1685
-
-
Taylor, M.E.1
Stone, P.2
-
11
-
-
77954599219
-
Linked multi-component mobile robots: Modeling, simulation and control
-
Z. Echegoyen, I. Villaverde, R. Moreno, M. Graña, and A. d'Anjou Linked multi-component mobile robots: modeling, simulation and control Robotics and Autonomous Systems 58 12 2010 1292 1305
-
(2010)
Robotics and Autonomous Systems
, vol.58
, Issue.12
, pp. 1292-1305
-
-
Echegoyen, Z.1
Villaverde, I.2
Moreno, R.3
Graña, M.4
D'Anjou, A.5
-
12
-
-
79951649734
-
-
Los Alamitos, CA. USA
-
H. Qin, D. Terzopoulos, D-nurbs: a physics-based framework for geometric design, technical report, Los Alamitos, CA. USA, 1996.
-
(1996)
D-nurbs: A Physics-based Framework for Geometric Design, Technical Report
-
-
Qin, H.1
Terzopoulos, D.2
-
13
-
-
38649101898
-
Geometrically exact dynamic splines
-
A. Theetten, L. Grisoni, C. Andriot, and B. Barsky Geometrically exact dynamic splines Computer-Aided Design 40 1 2008 35 48
-
(2008)
Computer-Aided Design
, vol.40
, Issue.1
, pp. 35-48
-
-
Theetten, A.1
Grisoni, L.2
Andriot, C.3
Barsky, B.4
-
16
-
-
34249833101
-
-
C. Watkins, P. Dayan, Technical note: Q-learning, in: Machine Learning, vol. 8, pp. 279-292, 1992.
-
(1992)
Technical Note: Q-learning, In: Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
17
-
-
84942867726
-
An overview of maxq hierarchical reinforcement learning
-
Berthe Choueiry, Toby Walsh, Lecture Notes in Computer Science Springer Berlin Heidelberg
-
T. Dietterich An overview of maxq hierarchical reinforcement learning Berthe Choueiry, Toby Walsh, Abstraction, Reformulation, and Approximation Lecture Notes in Computer Science vol. 1864 2000 Springer Berlin Heidelberg 26 44
-
(2000)
Abstraction, Reformulation, and Approximation
, vol.1864
, pp. 26-44
-
-
Dietterich, T.1
-
18
-
-
22944471767
-
Model approximation for hexq hierarchical reinforcement learning
-
B. Hengst, Model approximation for hexq hierarchical reinforcement learning, in: ECML 2004, pp. 144-155, 2004.
-
(2004)
ECML 2004
, pp. 144-155
-
-
Hengst, B.1
-
19
-
-
38149025031
-
Multi-robot cooperation based on hierarchical reinforcement learning
-
X. Cheng, J. Shen, H. Liu, and G. Gu Multi-robot cooperation based on hierarchical reinforcement learning Lecture Notes in Computer Science 4489 2007 90 97
-
(2007)
Lecture Notes in Computer Science
, vol.4489
, pp. 90-97
-
-
Cheng, X.1
Shen, J.2
Liu, H.3
Gu, G.4
-
20
-
-
31144477417
-
Risk-sensitive reinforcement learning applied to control under constraints
-
P. Geibel, and F. Wysotzki Risk-sensitive reinforcement learning applied to control under constraints Journal of Artificial Intelligence Research 24 2005 81 108
-
(2005)
Journal of Artificial Intelligence Research
, vol.24
, pp. 81-108
-
-
Geibel, P.1
Wysotzki, F.2
-
21
-
-
33750372439
-
Reinforcement learning for MDPs with constraints
-
Johannes Fürnkranz, Tobias Scheffer, Myra Spiliopoulou, Lecture Notes in Computer Science Springer
-
P. Geibel Reinforcement learning for MDPs with constraints Johannes Fürnkranz, Tobias Scheffer, Myra Spiliopoulou, ECML Lecture Notes in Computer Science vol. 4212 2006 Springer 646 653
-
(2006)
ECML
, vol.4212
, pp. 646-653
-
-
Geibel, P.1
-
24
-
-
31844444663
-
Exploration and apprenticeship learning in reinforcement learning
-
P. Abbeel, A.Y. Ng, Exploration and apprenticeship learning in reinforcement learning, in: Proceedings of 21st International Conference on Machine Learning, ICML, pp. 1-8, 2005.
-
(2005)
Proceedings of 21st International Conference on Machine Learning, ICML
, pp. 1-8
-
-
Abbeel, P.1
Ng, A.Y.2
-
25
-
-
79956136559
-
Safe exploration for reinforcement learning
-
A. Hans, D. Schneegaß, A.M. Schäfer, S. Udluft, Safe exploration for reinforcement learning, in: ESANN, pp. 143-148, 2008.
-
(2008)
ESANN
, pp. 143-148
-
-
Hans, A.1
Schneegaß, D.2
Schäfer, A.M.3
Udluft, S.4
-
26
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
R. Sutton, D. Precup, and S. Singh Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning Artificial Intelligence 112 1999 181 211
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.1
Precup, D.2
Singh, S.3
-
27
-
-
0344752303
-
Training and tracking in robotics
-
Morgan Kaufmann Publishers Inc. San Francisco, CA, USA
-
O.G. Selfridge, R.S. Sutton, and A.G. Barto Training and tracking in robotics Proceedings of the 9th International Joint Conference on Artificial Intelligence - Volume 1 1985 Morgan Kaufmann Publishers Inc. San Francisco, CA, USA 670 672
-
(1985)
Proceedings of the 9th International Joint Conference on Artificial Intelligence - Volume 1
, pp. 670-672
-
-
Selfridge, O.G.1
Sutton, R.S.2
Barto, A.G.3
-
28
-
-
56049125072
-
Transfer of samples in batch reinforcement learning
-
A. Lazaric, M. Restelli, A. Bonarini A., Transfer of samples in batch reinforcement learning, in: Proceedings of the 25th Annual ICML, pp. 544-551, 2008.
-
(2008)
Proceedings of the 25th Annual ICML
, pp. 544-551
-
-
Lazaric, A.1
Restelli, M.2
Bonarini A, A.3
-
29
-
-
58349096666
-
Proto-transfer learning in Markov decision processes using spectral methods
-
K. Ferguson, S. Mahadevan, Proto-transfer learning in Markov decision processes using spectral methods, in: ICML Workshop on Transfer Learning, 2006.
-
(2006)
ICML Workshop on Transfer Learning
-
-
Ferguson, K.1
Mahadevan, S.2
-
33
-
-
84880803349
-
Generalizing plans to new environments in relational MDPs
-
C. Guestrin, D. Koller, C. Gearhart, N. Kanodia, Generalizing plans to new environments in relational MDPs, in: International Joint Conference on Artificial Intelligence, IJCAI-03, pp. 1003-1010, 2003.
-
(2003)
International Joint Conference on Artificial Intelligence, IJCAI-03
, pp. 1003-1010
-
-
Guestrin, C.1
Koller, D.2
Gearhart, C.3
Kanodia, N.4
|