-
1
-
-
0020970738
-
Neuron-like elements that can solve difficult learning control problems
-
Barto, A.G., Sutton, R.S., Anderson, C.: Neuron-like elements that can solve difficult learning control problems. IEEE Transaction on Systems, Man and Cybernetics 13, 835-846 (1983)
-
(1983)
IEEE Transaction on Systems, Man and Cybernetics
, vol.13
, pp. 835-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.3
-
2
-
-
70349984547
-
Natural actor-critic algorithms
-
Bhatnagar, S., Sutton, R.S., Ghavamzadeh, M., Lee, M.: Natural actor-critic algorithms. Automatica 45(11), 2471-2482 (2009)
-
(2009)
Automatica
, vol.45
, Issue.11
, pp. 2471-2482
-
-
Bhatnagar, S.1
Sutton, R.S.2
Ghavamzadeh, M.3
Lee, M.4
-
3
-
-
48349140736
-
Rollout sampling approximate policy iteration
-
Dimitrakakis, C., Lagoudakis, M.G.: Rollout sampling approximate policy iteration. Machine Learning 72(3), 157-171 (2008)
-
(2008)
Machine Learning
, vol.72
, Issue.3
, pp. 157-171
-
-
Dimitrakakis, C.1
Lagoudakis, M.G.2
-
4
-
-
33744466799
-
Approximate policy iteration with a policy language bias: Solving relational markov decision processes
-
Fern, A., Yoon, S.W., Givan, R.: Approximate policy iteration with a policy language bias: Solving relational markov decision processes. Journal of Artificial Intelligence Research 25, 75-118 (2006)
-
(2006)
Journal of Artificial Intelligence Research
, vol.25
, pp. 75-118
-
-
Fern, A.1
Yoon, S.W.2
Givan, R.3
-
6
-
-
84865254474
-
Rollout allocation strategies for classification-based policy iteration
-
Auer, P., Kaski, S., Szepesvàri, C. (eds.)
-
Gabillon, V., Lazaric, A., Ghavamzadeh, M.: Rollout allocation strategies for classification-based policy iteration. In: Auer, P., Kaski, S., Szepesvàri, C. (eds.) Proceedings of the ICML 2010 Workshop on Reinforcement Learning and Search in Very Large Spaces (2010)
-
Proceedings of the ICML 2010 Workshop on Reinforcement Learning and Search in Very Large Spaces (2010)
-
-
Gabillon, V.1
Lazaric, A.2
Ghavamzadeh, M.3
-
7
-
-
76749092270
-
The weka data mining software: An update
-
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The weka data mining software: An update. SIGKDD Explorations 11(1), 10-18 (2009)
-
(2009)
SIGKDD Explorations
, vol.11
, Issue.1
, pp. 10-18
-
-
Hall, M.1
Frank, E.2
Holmes, G.3
Pfahringer, B.4
Reutemann, P.5
Witten, I.6
-
8
-
-
52949143827
-
Label ranking by learning pairwise preferences
-
Hüllermeier, E., Fürnkranz, J., Cheng, W., Brinker, K.: Label ranking by learning pairwise preferences. Artificial Intelligence 172, 1897-1916 (2008)
-
(2008)
Artificial Intelligence
, vol.172
, pp. 1897-1916
-
-
Hüllermeier, E.1
Fürnkranz, J.2
Cheng, W.3
Brinker, K.4
-
9
-
-
56449088242
-
Non-parametric policy gradients: A unified treatment of propositional and relational domains
-
Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) ACM, Helsinki
-
Kersting, K., Driessens, K.: Non-parametric policy gradients: a unified treatment of propositional and relational domains. In: Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) Proceedings of the 25th International Conference on Machine Learning (ICML 2008), pp. 456-463. ACM, Helsinki (2008)
-
(2008)
Proceedings of the 25th International Conference on Machine Learning (ICML 2008)
, pp. 456-463
-
-
Kersting, K.1
Driessens, K.2
-
11
-
-
1942420814
-
Reinforcement learning as classification: Leveraging modern classifiers
-
Fawcett, T.E., Mishra, N. (eds.) AAAI Press, Washington, DC
-
Lagoudakis, M.G., Parr, R.: Reinforcement learning as classification: Leveraging modern classifiers. In: Fawcett, T.E., Mishra, N. (eds.) Proceedings of the 20th International Conference on Machine Learning (ICML 2003), pp. 424-431. AAAI Press, Washington, DC (2003)
-
(2003)
Proceedings of the 20th International Conference on Machine Learning (ICML 2003)
, pp. 424-431
-
-
Lagoudakis, M.G.1
Parr, R.2
-
12
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R.S.: Learning to predict by the methods of temporal differences. Machine Learning 3, 9-44 (1988)
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
13
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
Solla, S.A., Leen, T.K., Müller, K.-R. (eds.) MIT Press, Denver
-
Sutton, R.S., McAllester, D.A., Singh, S.P., Mansour, Y.: Policy gradient methods for reinforcement learning with function approximation. In: Solla, S.A., Leen, T.K., Müller, K.-R. (eds.) Advances in Neural Information Processing Systems 12 (NIPS- 1999), pp. 1057-1063. MIT Press, Denver (1999)
-
(1999)
Advances in Neural Information Processing Systems 12 (NIPS- 1999)
, pp. 1057-1063
-
-
Sutton, R.S.1
McAllester, D.A.2
Singh, S.P.3
Mansour, Y.4
-
14
-
-
84890217212
-
Label ranking algorithms: A survey
-
Fürnkranz and Hüllermeier
-
Vembu, S., Gärtner, T.: Label ranking algorithms: A survey. In: Fürnkranz and Hüllermeier [5], pp. 45-64.
-
Preference Learning
, pp. 45-64
-
-
Vembu, S.1
Gärtner, T.2
-
16
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8, 229-256 (1992)
-
(1992)
Machine Learning
, vol.8
, pp. 229-256
-
-
Williams, R.J.1
-
17
-
-
70449449564
-
Reinforcement learning design for cancer clinical trials
-
Zhao, Y., Kosorok, M., Zeng, D.: Reinforcement learning design for cancer clinical trials. Statistics in Medicine 28, 3295-3315 (2009)
-
(2009)
Statistics in Medicine
, vol.28
, pp. 3295-3315
-
-
Zhao, Y.1
Kosorok, M.2
Zeng, D.3
|