-
3
-
-
63149159130
-
A survey of robot learning from demonstration
-
May
-
Brenna D. Argall, Sonia Chernova, Manuela Veloso, and Brett Browning. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5):469-483, May 2009.
-
(2009)
Robotics and Autonomous Systems
, vol.57
, Issue.5
, pp. 469-483
-
-
Argall, B.D.1
Chernova, S.2
Veloso, M.3
Browning, B.4
-
5
-
-
70049096468
-
Regularized policy iteration
-
A.M. Farahmand, M. Ghavamzadeh, C. Szepesvari, and S. Mannor. Regularized policy iteration. Advances in Neural Information Processing Systems, 21:441-448, 2009.
-
(2009)
Advances in Neural Information Processing Systems
, vol.21
, pp. 441-448
-
-
Farahmand, A.M.1
Ghavamzadeh, M.2
Szepesvari, C.3
Mannor, S.4
-
8
-
-
33749263205
-
Automatic basis function construction for approximate dynamic programming and reinforcement learning
-
ACM
-
P.W. Keller, S. Mannor, and D. Precup. Automatic basis function construction for approximate dynamic programming and reinforcement learning. In International Conference on Machine learning, pages 449-456. ACM, 2006.
-
(2006)
International Conference on Machine Learning
, pp. 449-456
-
-
Keller, P.W.1
Mannor, S.2
Precup, D.3
-
9
-
-
71149121683
-
Regularization and feature selection in least-squares temporal difference learning
-
ACM
-
J.Z. Kolter and A.Y. Ng. Regularization and feature selection in least-squares temporal difference learning. In International Conference on Machine Learning, pages 521-528. ACM, 2009.
-
(2009)
International Conference on Machine Learning
, pp. 521-528
-
-
Kolter, J.Z.1
Ng, A.Y.2
-
11
-
-
84881074926
-
Automatic induction of maxq hierarchies
-
S. Ray N. Mehta, M. Wynkoop, P. Tadepalli, and T. Dietterich. Automatic induction of maxq hierarchies. In Proceedings of the Hierarchical Organization of Behavior Workshop. 21st Conference on Neural Information Processing Systems, 2007.
-
Proceedings of the Hierarchical Organization of Behavior Workshop. 21st Conference on Neural Information Processing Systems, 2007
-
-
Ray, S.1
Mehta, N.2
Wynkoop, M.3
Tadepalli, P.4
Dietterich, T.5
-
12
-
-
84898980684
-
Autonomous helicopter flight via reinforcement learning
-
A.Y. Ng, H.J. Kim, M.I. Jordan, S. Sastry, and S. Ballianda. Autonomous helicopter flight via reinforcement learning. Advances in Neural Information Processing Systems, 16, 2004.
-
(2004)
Advances in Neural Information Processing Systems
, pp. 16
-
-
Ng, A.Y.1
Kim, H.J.2
Jordan, M.I.3
Sastry, S.4
Ballianda, S.5
-
13
-
-
34547982545
-
Analyzing feature generation for value-function approximation
-
ACM
-
R. Parr, C. Painter-Wakefield, L. Li, and M. Littman. Analyzing feature generation for value-function approximation. In International Conference on Machine learning, pages 737-744. ACM, 2007.
-
(2007)
International Conference on Machine Learning
, pp. 737-744
-
-
Parr, R.1
Painter-Wakefield, C.2
Li, L.3
Littman, M.4
-
16
-
-
71149102986
-
Discovering options from example trajectories
-
ACM New York, NY, USA
-
P. Zang, P. Zhou, D. Minnen, and C.L. Isbell. Discovering options from example trajectories. In Proceedings of the 26th Annual International Conference on Machine Learning. ACM New York, NY, USA, 2009.
-
(2009)
Proceedings of the 26th Annual International Conference on Machine Learning
-
-
Zang, P.1
Zhou, P.2
Minnen, D.3
Isbell, C.L.4
-
17
-
-
33947681316
-
MLKNN: A lazy learning approach to multi-label learning
-
M.L. Zhang and Z.H. Zhou. MLKNN: A lazy learning approach to multi-label learning. Pattern Recognition, 40(7):2038-2048, 2007.
-
(2007)
Pattern Recognition
, vol.40
, Issue.7
, pp. 2038-2048
-
-
Zhang, M.L.1
Zhou, Z.H.2
|