-
2
-
-
14344251217
-
Apprenticeship learning via inverse reinforcement learning
-
Abbeel, P., Ng, A.: Apprenticeship learning via inverse reinforcement learning. In: Proc. 21st Int. Conf. Machine Learning, pp. 1-8 (2004)
-
(2004)
Proc. 21st Int. Conf. Machine Learning
, pp. 1-8
-
-
Abbeel, P.1
Ng, A.2
-
3
-
-
63149159130
-
A survey of robot learning from demonstration
-
Argall, B., Chernova, S., Veloso, M.: A survey of robot learning from demonstration. Robotics and Autonomous Systems 57(5), 469-483 (2009)
-
(2009)
Robotics and Autonomous Systems
, vol.57
, Issue.5
, pp. 469-483
-
-
Argall, B.1
Chernova, S.2
Veloso, M.3
-
4
-
-
65349173646
-
Interactive policy learning through confidence-based autonomy
-
Chernova, S., Veloso, M.: Interactive policy learning through confidence-based autonomy. J. Artificial Intelligence Research 34, 1-25 (2009)
-
(2009)
J. Artificial Intelligence Research
, vol.34
, pp. 1-25
-
-
Chernova, S.1
Veloso, M.2
-
6
-
-
33744466799
-
Approximate policy iteration with a policy language bias: Solving relational Markov decision processes
-
Fern, A., Yoon, S., Givan, R.: Approximate policy iteration with a policy language bias: Solving relational Markov decision processes. J. Artificial Intelligence Research 25, 75-118 (2006)
-
(2006)
J. Artificial Intelligence Research
, vol.25
, pp. 75-118
-
-
Fern, A.1
Yoon, S.2
Givan, R.3
-
7
-
-
38149108415
-
Metrics for finite Markov decision processes
-
Ferns, N., Panangaden, P., Precup, D.: Metrics for finite Markov decision processes. In: Proc. 20th Conf. Uncertainty in Artificial Intelligence, pp. 162-169 (2004)
-
(2004)
Proc. 20th Conf. Uncertainty in Artificial Intelligence
, pp. 162-169
-
-
Ferns, N.1
Panangaden, P.2
Precup, D.3
-
8
-
-
47249139892
-
Metrics for Markov decision processes with infinite state-spaces
-
Ferns, N., Panangaden, P., Precup, D.: Metrics for Markov decision processes with infinite state-spaces. In: Proc. 21st Conf. Uncertainty in Artificial Intelligence, pp. 201-208 (2005)
-
(2005)
Proc. 21st Conf. Uncertainty in Artificial Intelligence
, pp. 201-208
-
-
Ferns, N.1
Panangaden, P.2
Precup, D.3
-
9
-
-
0038517214
-
Equivalence notions and model minimization in Markov Decision Processes
-
Givan, R., Dean, T., Greig, M.: Equivalence notions and model minimization in Markov Decision Processes. Artificial Intelligence 147, 163-223 (2003)
-
(2003)
Artificial Intelligence
, vol.147
, pp. 163-223
-
-
Givan, R.1
Dean, T.2
Greig, M.3
-
10
-
-
1942420814
-
Reinforcement learning as classification: Leveraging modern classifiers
-
Lagoudakis, M., Parr, R.: Reinforcement learning as classification: Leveraging modern classifiers. In: Proc. 20th Int. Conf. Machine Learning, pp. D424-D431 (2003)
-
(2003)
Proc. 20th Int. Conf. Machine Learning
-
-
Lagoudakis, M.1
Parr, R.2
-
11
-
-
31844448029
-
Relating reinforcement learning performance to classification performance
-
Langford, J., Zadrozny, B.: Relating reinforcement learning performance to classification performance. In: Proc. 22nd Int. Conf. Machine Learning, pp. D473-D480 (2005)
-
(2005)
Proc. 22nd Int. Conf. Machine Learning
-
-
Langford, J.1
Zadrozny, B.2
-
13
-
-
74049086730
-
Abstraction levels for robotic imitation: Overview and computational approaches
-
Lopes, M., Melo, F., Montesano, L., Santos-Victor, J.: Abstraction levels for robotic imitation: Overview and computational approaches. In: From Motor Learning to Interaction Learning in Robots, pp. 313-355 (2010)
-
(2010)
From Motor Learning to Interaction Learning in Robots
, pp. 313-355
-
-
Lopes, M.1
Melo, F.2
Montesano, L.3
Santos-Victor, J.4
-
16
-
-
72449199041
-
Training parsers by inverse reinforcement learning
-
accepted
-
Neu, G., Szepesvári, C.: Training parsers by inverse reinforcement learning. Machine Learning (2009) (accepted)
-
(2009)
Machine Learning
-
-
Neu, G.1
Szepesvári, C.2
-
18
-
-
0003212629
-
Efficient training of artificial neural networks for autonomous navigation
-
Pomerleau, D.: Efficient training of artificial neural networks for autonomous navigation. Neural Computation 3(1), 88-97 (1991)
-
(1991)
Neural. Computation
, vol.3
, Issue.1
, pp. 88-97
-
-
Pomerleau, D.1
-
20
-
-
33749252753
-
Maximum margin planning
-
Ratliff, N., Bagnell, J., Zinkevich, M.: Maximum margin planning. In: Proc. 23rd Int. Conf. Machine Learning, pp. 729-736 (2006)
-
(2006)
Proc. 23rd Int. Conf. Machine Learning
, pp. 729-736
-
-
Ratliff, N.1
Bagnell, J.2
Zinkevich, M.3
-
23
-
-
0003408420
-
-
MIT Press, Cambridge
-
Schölkopf, B., Smola, A.: Learning with kernels: Support vector machines, regularization, optimization and beyond. MIT Press, Cambridge (2002)
-
(2002)
Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond
-
-
Schölkopf, B.1
Smola, A.2
-
24
-
-
68949137209
-
Active learning literature survey
-
Univ. Wisconsin-Maddison
-
Settles, B.: Active learning literature survey. Tech. Rep. CS Tech. Rep. 1648, Univ. Wisconsin-Maddison (2009)
-
(2009)
Tech. Rep. CS Tech. Rep.
, vol.1648
-
-
Settles, B.1
-
25
-
-
85162012324
-
A game-theoretic approach to apprenticeship learning
-
Syed, U., Schapire, R.: A game-theoretic approach to apprenticeship learning. In: Adv. Neural Information Proc. Systems, vol. 20, pp. 1449-1456 (2008)
-
(2008)
Adv. Neural. Information Proc. Systems
, vol.20
, pp. 1449-1456
-
-
Syed, U.1
Schapire, R.2
-
26
-
-
56449119102
-
Apprenticeship learning using linear programming
-
Syed, U., Schapire, R., Bowling, M.: Apprenticeship learning using linear programming. In: Proc. 25th Int. Conf. Machine Learning, pp. 1032-1039 (2008)
-
(2008)
Proc. 25th Int. Conf. Machine Learning
, pp. 1032-1039
-
-
Syed, U.1
Schapire, R.2
Bowling, M.3
-
27
-
-
78049390608
-
Bounding performance loss in approximate MDP homomorphisms
-
Taylor, J., Precup, D., Panangaden, P.: Bounding performance loss in approximate MDP homomorphisms. In: Adv. Neural Information Proc. Systems, pp. 1649-1656 (2008)
-
(2008)
Adv. Neural. Information Proc. Systems
, pp. 1649-1656
-
-
Taylor, J.1
Precup, D.2
Panangaden, P.3
-
28
-
-
84898974832
-
Kernel logistic regression and the import vector machine
-
Zhu, J., Hastie, T.: Kernel logistic regression and the import vector machine. In: Adv. Neural Information Proc. Systems. pp. 1081-1088 (2002)
-
(2002)
Adv. Neural. Information Proc. Systems
, pp. 1081-1088
-
-
Zhu, J.1
Hastie, T.2
-
29
-
-
57749097473
-
Maximum entropy inverse reinforcement learning
-
Ziebart, B., Maas, A., Bagnell, J., Dey, A.: Maximum entropy inverse reinforcement learning. In: Proc. 23rd AAAI Conf. Artificial Intelligence, pp. 1433-1438 (2008)
-
(2008)
Proc. 23rd AAAI Conf. Artificial Intelligence
, pp. 1433-1438
-
-
Ziebart, B.1
Maas, A.2
Bagnell, J.3
Dey, A.4
|