-
1
-
-
0041965975
-
R-max - A general polynomial time algorithm for near-optimal reinforcement learning
-
Brafman, Ronen I. and Tennenholtz, Moshe. R-max - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3:213-231, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 213-231
-
-
Brafman, R.I.1
Tennenholtz, M.2
-
2
-
-
34250766214
-
Learning the structure of Factored Markov Decision Processes in reinforcement learning problems
-
Degris, Thomas, Sigaud, Olivier, and Wuillemin, Pierre-Henri. Learning the structure of Factored Markov Decision Processes in reinforcement learning problems. In Proceedings of the 23rd International Conference on Machine learning' 06, pp. 257-264.
-
Proceedings of the 23rd International Conference on Machine Learning' 06
, pp. 257-264
-
-
Degris, T.1
Sigaud, O.2
Wuillemin, P.-H.3
-
3
-
-
71149108881
-
The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning
-
Diuk, Carlos, Li, Lihong, and Leffler, Bethany R. The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning. In Proceedings of the 26th International Conference on Machine Learning'09, pp. 249-256.
-
Proceedings of the 26th International Conference on Machine Learning'09
, pp. 249-256
-
-
Diuk, C.1
Li, L.2
Leffler, B.R.3
-
4
-
-
33749245414
-
Algorithm-directed exploration for model-based reinforcement learning in Factored MDPs
-
Guestrin, Carlos, Patrascu, Relu, and Schuurmans, Dale. Algorithm-directed exploration for model-based reinforcement learning in Factored MDPs. In Proceedings of the 19th International Conference on Machine Learning'02, pp. 235-242.
-
Proceedings of the 19th International Conference on Machine Learning'02
, pp. 235-242
-
-
Guestrin, C.1
Patrascu, R.2
Schuurmans, D.3
-
5
-
-
4544318426
-
Efficient solution algorithms for Factored MDPs
-
Guestrin, Carlos, Koller, Daphne, Parr, Ronald, and Venkataraman, Shobha. Efficient solution algorithms for Factored MDPs. Journal of the Artificial Intelligence Research, pp. 399-468, 2003.
-
(2003)
Journal of the Artificial Intelligence Research
, pp. 399-468
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
Venkataraman, S.4
-
6
-
-
0036832954
-
Near-optimal reinforcement learning in polynomial time
-
Kearns, Michael and Singh, Satinder. Near-optimal reinforcement learning in polynomial time. Machine Learning, pp. 209-232, 2002.
-
(2002)
Machine Learning
, pp. 209-232
-
-
Kearns, M.1
Singh, S.2
-
8
-
-
36348930987
-
Efficient structure learning in Factored-State MDPs
-
Strehl, Alexander L., Diuk, Carlos, and Littman, Michael L. Efficient structure learning in Factored-State MDPs. In Proceedings of the 22nd National Conference on Artificial intelligence, pp. 645-650.
-
Proceedings of the 22nd National Conference on Artificial Intelligence
, pp. 645-650
-
-
Strehl, A.L.1
Diuk, C.2
Littman, M.L.3
|