-
1
-
-
0016556021
-
A new approach to manipulator control: The cerebellar model articulation controller
-
0314.92007 10.1115/1.3426922
-
Albus, J. S. (1975). A new approach to manipulator control: the cerebellar model articulation controller. Journal of Dynamic Systems, Measurement, and Control, 97(3), 220-227.
-
(1975)
Journal of Dynamic Systems, Measurement, and Control
, vol.97
, Issue.3
, pp. 220-227
-
-
Albus, J.S.1
-
2
-
-
78649507911
-
A Bayesian sampling approach to exploration in reinforcement learning
-
Asmuth, J., Li, L., Littman, M., Nouri, A., & Wingate, D. (2009). A Bayesian sampling approach to exploration in reinforcement learning. In Proceedings of the 25th conference on uncertainty in artificial intelligence (UAI).
-
(2009)
Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence (UAI)
-
-
Asmuth, J.1
Li, L.2
Littman, M.3
Nouri, A.4
Wingate, D.5
-
3
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
1012.68093 10.1023/A:1013689704352
-
Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2), 235-256.
-
(2002)
Machine Learning
, vol.47
, Issue.2
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
4
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
10.1016/0004-3702(94)00011-O
-
Barto, A. G., Bradtke, S. J., & Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72(1-2), 81-138.
-
(1995)
Artificial Intelligence
, vol.72
, Issue.1-2
, pp. 81-138
-
-
Barto, A.G.1
Bradtke, S.J.2
Singh, S.P.3
-
5
-
-
84973495235
-
Multiagent interactions in urban driving
-
Beeson, P., O'Quin, J., Gillan, B., Nimmagadda, T., Ristroph, M., Li, D., & Stone, P. (2008). Multiagent interactions in urban driving. Journal of Physical Agents, 2(1), 15-30.
-
(2008)
Journal of Physical Agents
, vol.2
, Issue.1
, pp. 15-30
-
-
Beeson, P.1
O'Quin, J.2
Gillan, B.3
Nimmagadda, T.4
Ristroph, M.5
Li, D.6
Stone, P.7
-
7
-
-
0035478854
-
Random forests
-
1007.68152 10.1023/A:1010933404324
-
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32.
-
(2001)
Machine Learning
, vol.45
, Issue.1
, pp. 5-32
-
-
Breiman, L.1
-
16
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
2249830 1222.68193
-
Ernst, D., Geurts, P., & Wehenkel, L. (2005). Tree-based batch mode reinforcement learning. Journal of Machine Learning Research, 6, 503-556.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
17
-
-
78149267893
-
Intrinsically motivated information foraging
-
Fasel, I., Wilt, A., Mafi, N., & Morris, C. (2010). Intrinsically motivated information foraging. In Proceedings of the ninth international conference on development and learning (ICDL).
-
(2010)
Proceedings of the Ninth International Conference on Development and Learning (ICDL)
-
-
Fasel, I.1
Wilt, A.2
Mafi, N.3
Morris, C.4
-
19
-
-
55849142990
-
The parallelization of Monte-Carlo planning
-
Gelly, S., Hoock, J. B., Rimmel, A., Teytaud, O., & Kalemkarian, Y. (2008). The parallelization of Monte-Carlo planning. In Proceedings of the fifth international conference on informatics in control, automation and robotics, intelligent control systems and optimization (ICINCO 2008) (pp. 244-249).
-
(2008)
Proceedings of the Fifth International Conference on Informatics in Control, Automation and Robotics, Intelligent Control Systems and Optimization (ICINCO 2008)
, pp. 244-249
-
-
Gelly, S.1
Hoock, J.B.2
Rimmel, A.3
Teytaud, O.4
Kalemkarian, Y.5
-
26
-
-
0037399236
-
Markov decision processes with delays and asynchronous cost collection
-
1968039 10.1109/TAC.2003.809799
-
Katsikopoulos, K., & Engelbrecht, S. (2003). Markov decision processes with delays and asynchronous cost collection. IEEE Transactions on Automatic Control, 48(4), 568-574.
-
(2003)
IEEE Transactions on Automatic Control
, vol.48
, Issue.4
, pp. 568-574
-
-
Katsikopoulos, K.1
Engelbrecht, S.2
-
28
-
-
78049390740
-
Policy search for motor primitives in robotics
-
1237.68229 10.1007/s10994-010-5223-6
-
Kober, J., & Peters, J. (2011). Policy search for motor primitives in robotics. Machine Learning, 84(1-2), 171-203.
-
(2011)
Machine Learning
, vol.84
, Issue.1-2
, pp. 171-203
-
-
Kober, J.1
Peters, J.2
-
31
-
-
84868298260
-
LRTDP versus UCT for online probabilistic planning
-
Kolobov, A., Mausam, & Weld, D. (2012). LRTDP versus UCT for online probabilistic planning. In AAAI conference on artificial intelligence. https://www.aaai.org/ocs/index.php/AAAI/AAAI12/paper/view/4961/5334.
-
(2012)
AAAI Conference on Artificial Intelligence
-
-
Kolobov, A.1
Mausam2
Weld, D.3
-
37
-
-
84855817203
-
A parallel general game player
-
10.1007/s13218-010-0083-6
-
Méhat, J., & Cazenave, T. (2011). A parallel general game player. KI. Künstliche Intelligenz, 25(1), 43-47.
-
(2011)
KI. Künstliche Intelligenz
, vol.25
, Issue.1
, pp. 43-47
-
-
Méhat, J.1
Cazenave, T.2
-
38
-
-
0036832953
-
Variable resolution discretization in optimal control
-
1005.68086 10.1023/A:1017992615625
-
Munos, R., & Moore, A. (2002). Variable resolution discretization in optimal control. Machine Learning, 49, 291-323.
-
(2002)
Machine Learning
, vol.49
, pp. 291-323
-
-
Munos, R.1
Moore, A.2
-
40
-
-
0037383659
-
What the cerebellum computes
-
10.1016/S0166-2236(03)00054-7
-
Ohyama, T., Nores, W. L., Murphy, M., & Mauk, M. D. (2003). What the cerebellum computes. Trends in Neurosciences, 26(4), 222-227.
-
(2003)
Trends in Neurosciences
, vol.26
, Issue.4
, pp. 222-227
-
-
Ohyama, T.1
Nores, W.L.2
Murphy, M.3
Mauk, M.D.4
-
41
-
-
34047267520
-
Intrinsic motivation systems for autonomous mental development
-
10.1109/TEVC.2006.890271
-
Oudeyer, P. Y., Kaplan, F., & Hafner, V. V. (2007). Intrinsic motivation systems for autonomous mental development. IEEE Transactions on Evolutionary Computation, 11(2), 265-286.
-
(2007)
IEEE Transactions on Evolutionary Computation
, vol.11
, Issue.2
, pp. 265-286
-
-
Oudeyer, P.Y.1
Kaplan, F.2
Hafner, V.V.3
-
42
-
-
33749251297
-
An analytic solution to discrete Bayesian reinforcement learning
-
Poupart, P., Vlassis, N., Hoey, J., & Regan, K. (2006). An analytic solution to discrete Bayesian reinforcement learning. In Proceedings of the twenty-third international conference on machine learning (ICML) (pp. 697-704).
-
(2006)
Proceedings of the Twenty-third International Conference on Machine Learning (ICML)
, pp. 697-704
-
-
Poupart, P.1
Vlassis, N.2
Hoey, J.3
Regan, K.4
-
43
-
-
77957352104
-
ROS: An open-source robot operating system
-
Quigley, M., Conley, K., Gerkey, B., Faust, J., Foote, T., Leibs, J., Wheeler, R., & Ng, A. (2009). ROS: an open-source robot operating system. In ICRA workshop on open source software.
-
(2009)
ICRA Workshop on Open Source Software
-
-
Quigley, M.1
Conley, K.2
Gerkey, B.3
Faust, J.4
Foote, T.5
Leibs, J.6
Wheeler, R.7
Ng, A.8
-
44
-
-
33744584654
-
Induction of decision trees
-
Quinlan, R. (1986). Induction of decision trees. Machine Learning, 1, 81-106.
-
(1986)
Machine Learning
, vol.1
, pp. 81-106
-
-
Quinlan, R.1
-
46
-
-
78651479757
-
Control delay in reinforcement learning for real-time dynamic systems: A memoryless approach
-
Schuitema, E., Busoniu, L., Babuska, R., & Jonker, P. (2010). Control delay in reinforcement learning for real-time dynamic systems: a memoryless approach. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 3226-3231).
-
(2010)
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
, pp. 3226-3231
-
-
Schuitema, E.1
Busoniu, L.2
Babuska, R.3
Jonker, P.4
-
48
-
-
84863416482
-
-
Silver, D., Sutton, R., & Muller, M. (2012). Temporal difference search in computer go. Machine Learning, 87
-
(2012)
Temporal Difference Search in Computer Go. Machine Learning
, pp. 87
-
-
Silver, D.1
Sutton, R.2
Muller, M.3
-
53
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
Sutton, R. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proceedings of the seventh international conference on machine learning (ICML) (pp. 216-224).
-
(1990)
Proceedings of the Seventh International Conference on Machine Learning (ICML)
, pp. 216-224
-
-
Sutton, R.1
-
55
-
-
84899464022
-
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
-
Sutton, R., Modayil, J., Delp, M., Degris, T., Pilarski, P., White, A., & Precup, D. (2011). Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction. In Proceedings of the tenth international joint conference on autonomous agents and multiagent systems (AAMAS).
-
(2011)
Proceedings of the Tenth International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS)
-
-
Sutton, R.1
Modayil, J.2
Delp, M.3
Degris, T.4
Pilarski, P.5
White, A.6
Precup, D.7
-
56
-
-
70449370276
-
RL-Glue: Language-independent software for reinforcement-learning experiments
-
Tanner, B., & White, A. (2009). RL-Glue: language-independent software for reinforcement-learning experiments. Journal of Machine Learning Research, 10, 2133-2136.
-
(2009)
Journal of Machine Learning Research
, vol.10
, pp. 2133-2136
-
-
Tanner, B.1
White, A.2
-
57
-
-
79956344726
-
A Monte-Carlo AIXI approximation
-
2805239 1214.68302
-
Veness, J., Ng, K. S., Hutter, M., Uther, W. T. B., & Silver, D. (2011). A Monte-Carlo AIXI approximation. The Journal of Artificial Intelligence Research, 40, 95-142.
-
(2011)
The Journal of Artificial Intelligence Research
, vol.40
, pp. 95-142
-
-
Veness, J.1
Ng, K.S.2
Hutter, M.3
Uther, W.T.B.4
Silver, D.5
-
58
-
-
58049186782
-
Learning and planning in environments with delayed feedback
-
10.1007/s10458-008-9056-7
-
Walsh, T., Nouri, A., Li, L., & Littman, M. (2009a). Learning and planning in environments with delayed feedback. Autonomous Agents and Multi-Agent Systems, 18, 83-105.
-
(2009)
Autonomous Agents and Multi-Agent Systems
, vol.18
, pp. 83-105
-
-
Walsh, T.1
Nouri, A.2
Li, L.3
Littman, M.4
-
59
-
-
79958846996
-
Exploring compact reinforcement-learning representations with linear regression
-
Walsh, T., Szita, I., Diuk, C., & Littman, M. (2009b). Exploring compact reinforcement-learning representations with linear regression. In Proceedings of the 25th conference on uncertainty in artificial intelligence (UAI).
-
(2009)
Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence (UAI)
-
-
Walsh, T.1
Szita, I.2
Diuk, C.3
Littman, M.4
-
61
-
-
31844436266
-
Bayesian sparse sampling for on-line reward optimization
-
10.1145/1102351.1102472
-
Wang, T., Lizotte, D., Bowling, M., & Schuurmans, D. (2005). Bayesian sparse sampling for on-line reward optimization. In Proceedings of the twenty-second international conference on machine learning (ICML) (pp. 956-963).
-
(2005)
Proceedings of the Twenty-second International Conference on Machine Learning (ICML)
, pp. 956-963
-
-
Wang, T.1
Lizotte, D.2
Bowling, M.3
Schuurmans, D.4
-
64
-
-
0029307102
-
The context tree weighting method: Basic properties
-
0837.94011 10.1109/18.382012
-
Willems, F. M. J., Shtarkov, Y. M., & Tjalkens, T. J. (1995). The context tree weighting method: basic properties. IEEE Transactions on Information Theory, 41, 653-664.
-
(1995)
IEEE Transactions on Information Theory
, vol.41
, pp. 653-664
-
-
Willems, F.M.J.1
Shtarkov, Y.M.2
Tjalkens, T.J.3
|