-
2
-
-
84887290718
-
Reinforcement learning with misspecified model classes
-
Joseph, J., Geramifard, A., Roberts, J. W., How, J. P. & Roy, N. (2013), Reinforcement learning with misspecified model classes, in '2013 IEEE International Conference on Robotics and Automation (ICRA)', pp. 939-946.
-
(2013)
2013 IEEE International Conference on Robotics and Automation (ICRA)
, pp. 939-946
-
-
Joseph, J.1
Geramifard, A.2
Roberts, J.W.3
How, J.P.4
Roy, N.5
-
3
-
-
0036832951
-
A sparse sampling algorithm for near-optimal planning in large markov decision processes
-
Kearns, M., Mansour, Y. & Ng, A. Y. (2002), 'A sparse sampling algorithm for near-optimal planning in large markov decision processes', Machine Learning 49(2-3), 193-208.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 193-208
-
-
Kearns, M.1
Mansour, Y.2
Ng, A.Y.3
-
5
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning planning and teaching
-
Lin, L.-J. (1992), 'Self-improving reactive agents based on reinforcement learning, planning and teaching', Machine learning 8(3-4), 293-321.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 293-321
-
-
Lin, L.-J.1
-
6
-
-
84896535000
-
Learning with marginalized corrupted features
-
Maaten, L., Chen, M., Tyree, S. & Weinberger, K. Q. (2013), Learning with marginalized corrupted features, in 'Proceedings of the 30th International Conference on Machine Learning (ICML-13)', pp. 410-418.
-
(2013)
Proceedings of the 30th International Conference on Machine Learning (ICML-13)
, pp. 410-418
-
-
Maaten, L.1
Chen, M.2
Tyree, S.3
Weinberger, K.Q.4
-
7
-
-
0026858102
-
Noise injection into inputs in backpropagation learning
-
Matsuoka, K. (1992), 'Noise injection into inputs in backpropagation learning', IEEE Transactions on Systems, Man and Cybernetics 22(3), 436-440.
-
(1992)
IEEE Transactions on Systems, Man and Cybernetics
, vol.22
, Issue.3
, pp. 436-440
-
-
Matsuoka, K.1
-
9
-
-
0022471098
-
Learning representations by back-propagating errors
-
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. (1986), 'Learning representations by back-propagating errors', Nature 323(9), 533-536.
-
(1986)
Nature
, vol.323
, Issue.9
, pp. 533-536
-
-
Rumelhart, D.E.1
Hinton, G.E.2
Williams, R.J.3
-
10
-
-
84991580149
-
A constraint generation approach to learning stable linear dynamical systems
-
Siddiqi, S. M., Boots, B. & Gordon, G. J. (2007), A constraint generation approach to learning stable linear dynamical systems, in 'Advances in Neural Information Processing Systems (NIPS)', pp. 1329-1336.
-
(2007)
Advances in Neural Information Processing Systems (NIPS)
, pp. 1329-1336
-
-
Siddiqi, S.M.1
Boots, B.2
Gordon, G.J.3
-
12
-
-
85161967377
-
Reward design via online gradient ascent
-
Sorg, J., Lewis, R. L. & Singh, S. (2010), Reward design via online gradient ascent, in 'Advances in Neural Information Processing Systems (NIPS)', pp. 2190-2198.
-
(2010)
Advances in Neural Information Processing Systems (NIPS)
, pp. 2190-2198
-
-
Sorg, J.1
Lewis, R.L.2
Singh, S.3
-
13
-
-
77956525933
-
Internal rewards mitigate agent boundedness
-
Sorg, J., Singh, S. & Lewis, R. L. (2010), Internal rewards mitigate agent boundedness, in 'Proceedings of the 27th International Conference on Machine Learning (ICML-10)', pp. 1007-1014.
-
(2010)
Proceedings of the 27th International Conference on Machine Learning (ICML-10)
, pp. 1007-1014
-
-
Sorg, J.1
Singh, S.2
Lewis, R.L.3
-
14
-
-
79956344726
-
A monte-carlo AIXI approximation
-
Veness, J., Ng, K. S., Hutter, M., Uther, W. T. B. & Silver, D. (2011), 'A Monte-Carlo AIXI Approximation', Journal of Artificial Intelligence Research 40, 95-142.
-
(2011)
Journal of Artificial Intelligence Research
, vol.40
, pp. 95-142
-
-
Veness, J.1
Ng, K.S.2
Hutter, M.3
Uther, W.T.B.4
Silver, D.5
-
15
-
-
0029307102
-
The context tree weighting method: Basic properties
-
Willems, F. M., Shtarkov, Y. M. & Tjalkens, T. J. (1995), 'The context tree weighting method: Basic properties', IEEE Transactions on Information Theory 41, 653-664.
-
(1995)
IEEE Transactions on Information Theory
, vol.41
, pp. 653-664
-
-
Willems, F.M.1
Shtarkov, Y.M.2
Tjalkens, T.J.3
|