-
2
-
-
33747195910
-
Machine learning for fast quadrupedal locomotion
-
N. Kohl and P. Stone, "Machine learning for fast quadrupedal locomotion," in AAAI, 2004.
-
(2004)
AAAI
-
-
Kohl, N.1
Stone, P.2
-
3
-
-
3042583887
-
Autonomous helicopter flight via reinforcement learning
-
A. Ng, H. J. Kim, M. Jordan, and S. Sastry, "Autonomous helicopter flight via reinforcement learning," in NIPS 16, 2003.
-
(2003)
NIPS
, vol.16
-
-
Ng, A.1
Kim, H.J.2
Jordan, M.3
Sastry, S.4
-
4
-
-
84880854156
-
R-Max - A general polynomial time algorithm for near-optimal reinforcement learning
-
R. Brafman and M. Tennenholtz, "R-Max - a general polynomial time algorithm for near-optimal reinforcement learning," in IJCAI, 2001.
-
(2001)
IJCAI
-
-
Brafman, R.1
Tennenholtz, M.2
-
5
-
-
80053441894
-
PILCO: A model-based and dataefficient approach to policy search
-
June
-
M. Deisenroth and C. Rasmussen, "PILCO: A model-based and dataefficient approach to policy search," in ICML, June 2011.
-
(2011)
ICML
-
-
Deisenroth, M.1
Rasmussen, C.2
-
6
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
R. Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," in ICML, 1990.
-
(1990)
ICML
-
-
Sutton, R.1
-
7
-
-
56449110907
-
Sample-based learning and search with permanent and transient memories
-
D. Silver, R. Sutton, and M. Müller, "Sample-based learning and search with permanent and transient memories," in ICML, 2008.
-
(2008)
ICML
-
-
Silver, D.1
Sutton, R.2
Müller, M.3
-
8
-
-
85167397400
-
Integrating sample-based planning and model-based reinforcement learning
-
T. Walsh, S. Goschin, and M. Littman, "Integrating sample-based planning and model-based reinforcement learning," in AAAI, 2010.
-
(2010)
AAAI
-
-
Walsh, T.1
Goschin, S.2
Littman, M.3
-
9
-
-
34547975806
-
Bandit based Monte-Carlo planning
-
L. Kocsis and C. Szepesvári, "Bandit based Monte-Carlo planning," in ECML, 2006.
-
(2006)
ECML
-
-
Kocsis, L.1
Szepesvári, C.2
-
10
-
-
78149247074
-
Real time targeted exploration in large domains
-
August
-
T. Hester and P. Stone, "Real time targeted exploration in large domains," in ICDL, August 2010.
-
(2010)
ICDL
-
-
Hester, T.1
Stone, P.2
-
12
-
-
84973495235
-
Multiagent interactions in urban driving
-
March
-
P. Beeson, et al., "Multiagent interactions in urban driving," Journal of Physical Agents, vol. 2, no. 1, pp. 15-30, March 2008.
-
(2008)
Journal of Physical Agents
, vol.2
, Issue.1
, pp. 15-30
-
-
Beeson, P.1
-
14
-
-
70449370276
-
RL-Glue: Language-independent software for reinforcement-learning experiments
-
Sep.
-
B. Tanner and A. White, "RL-Glue : Language-independent software for reinforcement-learning experiments," JMLR, vol. 10, Sep. 2009.
-
(2009)
JMLR
, vol.10
-
-
Tanner, B.1
White, A.2
-
15
-
-
0003673017
-
-
Ph.D. dissertation, Pittsburgh, PA, USA
-
L.-J. Lin, "Reinforcement learning for robots using neural networks," Ph.D. dissertation, Pittsburgh, PA, USA, 1992.
-
(1992)
Reinforcement Learning for Robots Using Neural Networks
-
-
Lin, L.-J.1
-
17
-
-
84899464022
-
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
-
R. Sutton, et al., "Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction," in AAMAS, 2011.
-
(2011)
AAMAS
-
-
Sutton, R.1
|