-
1
-
-
14344251217
-
Apprenticeship Learning via Inverse Reinforcement Learning
-
Brodley, C.E. (ed.) ACM, New York
-
Abbeel, P., Ng, A.Y.: Apprenticeship Learning via Inverse Reinforcement Learning. In: Brodley, C.E. (ed.) Proc. 21st Intl. Conf. on Machine Learning (ICML 2004). ACM Intl. Conf. Proc. Series, vol. 69, p. 1. ACM, New York (2004)
-
(2004)
Proc. 21st Intl. Conf. on Machine Learning (ICML 2004). ACM Intl. Conf. Proc. Series
, vol.69
, pp. 1
-
-
Abbeel, P.1
Ng, A.Y.2
-
2
-
-
16344383127
-
Convergence Results for the (1,λ)-SA-ES using the Theory of φ- Irreducible Markov Chains
-
Auger, A.: Convergence Results for the (1,λ)-SA-ES using the Theory of φ- irreducible Markov Chains. Theoretical Computer Science 334(1-3), 35-69 (2005)
-
(2005)
Theoretical Computer Science
, vol.334
, Issue.1-3
, pp. 35-69
-
-
Auger, A.1
-
3
-
-
1942450674
-
A Framework for Behavioural Cloning
-
Furukawa, K., Michie, D., Muggleton, S. (eds.) Oxford University Press, Oxford
-
Bain, M., Sammut, C.: A Framework for Behavioural Cloning. In: Furukawa, K., Michie, D., Muggleton, S. (eds.) Machine Intelligence, vol. 15, pp. 103-129. Oxford University Press, Oxford (1995)
-
(1995)
Machine Intelligence
, vol.15
, pp. 103-129
-
-
Bain, M.1
Sammut, C.2
-
4
-
-
80052420582
-
-
MIT Press, Cambridge
-
Bakir, G., Hofmann, T., Scholkopf, B., Smola, A.J., Taskar, B., Vishwanathan, S.V.N.: Machine Learning with Structured Outputs. MIT Press, Cambridge (2006)
-
(2006)
Machine Learning with Structured Outputs
-
-
Bakir, G.1
Hofmann, T.2
Scholkopf, B.3
Smola, A.J.4
Taskar, B.5
Vishwanathan, S.V.N.6
-
5
-
-
80052401394
-
-
Bredeche, N.: http://www.lri.fr/~bredeche/roborobo/
-
-
-
Bredeche, N.1
-
6
-
-
85161994270
-
Active Preference Learning with Discrete Choice Data
-
Brochu, E., de Freitas, N., Ghosh, A.: Active Preference Learning with Discrete Choice Data. In: Proc. NIPS 20, pp. 409-416 (2008)
-
(2008)
Proc. NIPS 20
, pp. 409-416
-
-
Brochu, E.1
De Freitas, N.2
Ghosh, A.3
-
7
-
-
34047173490
-
On Learning, Representing and Generalizing a Task in a Humanoid Robot
-
Calinon, S., Guenter, F., Billard, A.: On Learning, Representing and Generalizing a Task in a Humanoid Robot. IEEE Trans. on Systems, Man and Cybernetics, Special Issue on Robot Learning by Observation, Demonstration and Imitation 37(2), 286-298 (2007)
-
(2007)
IEEE Trans. on Systems, Man and Cybernetics, Special Issue on Robot Learning by Observation, Demonstration and Imitation
, vol.37
, Issue.2
, pp. 286-298
-
-
Calinon, S.1
Guenter, F.2
Billard, A.3
-
8
-
-
17444409624
-
A Tutorial on the Cross-Entropy Method
-
de Boer, P.-T., Kroese, D.P., Mannor, S., Rubinstein, R.Y.: A Tutorial on the Cross-Entropy Method. Annals OR 134(1), 19-67 (2005)
-
(2005)
Annals or
, vol.134
, Issue.1
, pp. 19-67
-
-
De Boer, P.-T.1
Kroese, D.P.2
Mannor, S.3
Rubinstein, R.Y.4
-
9
-
-
55249109544
-
The Forgetron: A Kernel-Based Perceptron on a Budget
-
Dekel, O., Shalev-Shwartz, S., Singer, Y.: The Forgetron: A Kernel-Based Perceptron on a Budget. SIAM J. Comput. 37, 1342-1372 (2008)
-
(2008)
SIAM J. Comput.
, vol.37
, pp. 1342-1372
-
-
Dekel, O.1
Shalev-Shwartz, S.2
Singer, Y.3
-
10
-
-
0030649484
-
Solving the Multiple-Instance Problem with Axis-Parallel Rectangles
-
Dietterich, T.G., Lathrop, R., Lozano-Perez, T.: Solving the Multiple-Instance Problem with Axis-Parallel Rectangles. Artif. Intelligence 89(1-2), 31-71 (1997)
-
(1997)
Artif. Intelligence
, vol.89
, Issue.1-2
, pp. 31-71
-
-
Dietterich, T.G.1
Lathrop, R.2
Lozano-Perez, T.3
-
12
-
-
31844446804
-
A Support Vector Method for Multivariate Performance Measures
-
De Raedt, L., Wrobel, S. (eds.) ACM, New York
-
Joachims, T.: A Support Vector Method for Multivariate Performance Measures. In: De Raedt, L., Wrobel, S. (eds.) Proc. 22nd ICML. ACM Intl. Conf. Proc. Series, vol. 119, pp. 377-384. ACM, New York (2005)
-
(2005)
Proc. 22nd ICML. ACM Intl. Conf. Proc. Series
, vol.119
, pp. 377-384
-
-
Joachims, T.1
-
13
-
-
33749563073
-
Training Linear SVMs in Linear Time
-
Eliassi-Rad, T., et al. (eds.) ACM, New York
-
Joachims, T.: Training Linear SVMs in Linear Time. In: Eliassi-Rad, T., et al. (eds.) Proc. 12th Intl. Conf. KDDM, pp. 217-226. ACM, New York (2006)
-
(2006)
Proc. 12th Intl. Conf. KDDM
, pp. 217-226
-
-
Joachims, T.1
-
14
-
-
85042697846
-
Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
-
MIT Press, Cambridge
-
Zico Kolter, J., Abbeel, P., Ng, A.Y.: Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion. In: Proc. NIPS 20. MIT Press, Cambridge (2007)
-
(2007)
Proc. NIPS 20
-
-
Zico Kolter, J.1
Abbeel, P.2
Ng, A.Y.3
-
15
-
-
84874610503
-
Exploiting Open-Endedness to Solve Problems Through the Search for Novelty
-
Lehman, J., Stanley, K.O.: Exploiting Open-Endedness to Solve Problems Through the Search for Novelty. In: Proc. Artificial Life XI, pp. 329-336 (2008)
-
(2008)
Proc. Artificial Life XI
, pp. 329-336
-
-
Lehman, J.1
Stanley, K.O.2
-
16
-
-
84874610503
-
Exploiting Open-Endedness to Solve Problems through the Search for Novelty
-
MIT Press, Cambridge
-
Lehman, J., Stanley, K.O.: Exploiting Open-Endedness to Solve Problems through the Search for Novelty. In: Proc. ALife 2008, MIT Press, Cambridge (2008)
-
(2008)
Proc. ALife 2008
-
-
Lehman, J.1
Stanley, K.O.2
-
17
-
-
85162484380
-
Feature Construction for Inverse Reinforcement Learning
-
Levine, S., Popovic, Z., Koltun, V.: Feature Construction for Inverse Reinforcement Learning. In: Proc. NIPS 23, pp. 1342-1350 (2010)
-
(2010)
Proc. NIPS
, vol.23
, pp. 1342-1350
-
-
Levine, S.1
Popovic, Z.2
Koltun, V.3
-
18
-
-
78650262449
-
Modeling and Optimization of Adaptive Foraging in Swarm Robotic Systems
-
Liu, W., Winfield, A.F.T.: Modeling and Optimization of Adaptive Foraging in Swarm Robotic Systems. Intl. J. Robotic Research 29(14), 1743-1760 (2010)
-
(2010)
Intl. J. Robotic Research
, vol.29
, Issue.14
, pp. 1743-1760
-
-
Liu, W.1
Winfield, A.F.T.2
-
19
-
-
0042547347
-
Algorithms for Inverse Reinforcement Learning
-
Langley, P. (ed.) Morgan Kaufmann, San Francisco
-
Ng, A.Y., Russell, S.: Algorithms for Inverse Reinforcement Learning. In: Langley, P. (ed.) Proc. 17th ICML, pp. 663-670. Morgan Kaufmann, San Francisco (2000)
-
(2000)
Proc. 17th ICML
, pp. 663-670
-
-
Ng, A.Y.1
Russell, S.2
-
20
-
-
44949241322
-
Reinforcement Learning of Motor Skills with Policy Gradients
-
Peters, J., Schaal, S.: Reinforcement Learning of Motor Skills with Policy Gradients. Neural Networks 21(4), 682-697 (2008)
-
(2008)
Neural Networks
, vol.21
, Issue.4
, pp. 682-697
-
-
Peters, J.1
Schaal, S.2
-
21
-
-
84864069017
-
Efficient Learning of Sparse Representations with an Energy-Based Model
-
Schölkopf, B., Platt, J.C., Hoffman, T. (eds.) MIT Press, Cambridge
-
Ranzato, M.-A., Poultney, C.S., Chopra, S., LeCun, Y.: Efficient Learning of Sparse Representations with an Energy-Based Model. In: Schölkopf, B., Platt, J.C., Hoffman, T. (eds.) Proc. NIPS 19, pp. 1137-1144. MIT Press, Cambridge (2006)
-
(2006)
Proc. NIPS
, vol.19
, pp. 1137-1144
-
-
Ranzato, M.-A.1
Poultney, C.S.2
Chopra, S.3
LeCun, Y.4
-
24
-
-
77954086905
-
Energy-efficient Indoor Search by Swarms of Simulated Flying Robots without Global Information
-
Stirling, T.S., Wischmann, S., Floreano, D.: Energy-efficient Indoor Search by Swarms of Simulated Flying Robots without Global Information. Swarm Intelligence 4(2), 117-143 (2010)
-
(2010)
Swarm Intelligence
, vol.4
, Issue.2
, pp. 117-143
-
-
Stirling, T.S.1
Wischmann, S.2
Floreano, D.3
-
25
-
-
33749255382
-
PAC Model-free Reinforcement Learning
-
Airoldi, E.M., Blei, D.M., Fienberg, S.E., Goldenberg, A., Xing, E.P., Zheng, A.X. (eds.) ICML 2006. Springer, Heidelberg
-
Strehl, A.L., Li, L., Wiewiora, E., Langford, J., Littman, M.L.: PAC Model-free Reinforcement Learning. In: Airoldi, E.M., Blei, D.M., Fienberg, S.E., Goldenberg, A., Xing, E.P., Zheng, A.X. (eds.) ICML 2006. LNCS, vol. 4503, pp. 881-888. Springer, Heidelberg (2007)
-
(2007)
LNCS
, vol.4503
, pp. 881-888
-
-
Strehl, A.L.1
Li, L.2
Wiewiora, E.3
Langford, J.4
Littman, M.L.5
-
27
-
-
85162012324
-
A Game-Theoretic Approach to Apprenticeship Learning
-
MIT Press, Cambridge
-
Syed, U., Schapire, R.: A Game-Theoretic Approach to Apprenticeship Learning. In: Proc. NIPS 21, pp. 1449-1456. MIT Press, Cambridge (2008)
-
(2008)
Proc. NIPS
, vol.21
, pp. 1449-1456
-
-
Syed, U.1
Schapire, R.2
-
28
-
-
70350150649
-
Improvements on Learning Tetris with Cross Entropy
-
Thiery, C., Scherrer, B.: Improvements on Learning Tetris with Cross Entropy. ICGA Journal 32(1), 23-33 (2009)
-
(2009)
ICGA Journal
, vol.32
, Issue.1
, pp. 23-33
-
-
Thiery, C.1
Scherrer, B.2
-
29
-
-
30944441110
-
Cooperative Hole Avoidance in a Swarm-bot
-
Trianni, V., Nolfi, S., Dorigo, M.: Cooperative Hole Avoidance in a Swarm-bot. Robotics and Autonomous Systems 54(2), 97-103 (2006)
-
(2006)
Robotics and Autonomous Systems
, vol.54
, Issue.2
, pp. 97-103
-
-
Trianni, V.1
Nolfi, S.2
Dorigo, M.3
|