-
2
-
-
0029537980
-
Issues and approaches in the design of collective autonomous a gents
-
Dec
-
M. Mataric, "Issues and approaches in the design of collective autonomous a gents," Robotics and Autonoumous Systems, vol. 16, pp. 321-331, Dec 1995.
-
(1995)
Robotics and Autonoumous Systems
, vol.16
, pp. 321-331
-
-
Mataric, M.1
-
3
-
-
84957702766
-
Multiobjective hybrid control synthisis
-
Proceedings of hybrid and realtime systems, Grenoble: Springer-Verlag, March
-
J.Lygeros, C.J.Tomlin, and S.Sastry, "Multiobjective hybrid control synthisis," in Proceedings of hybrid and realtime systems, vol. 1201 of Lecture Notes in Computer Science, Grenoble: Springer-Verlag, March 1997.
-
(1997)
Lecture Notes in Computer Science
, vol.1201
-
-
Lygeros, J.1
Tomlin, C.J.2
Sastry, S.3
-
4
-
-
0033311181
-
Basic problems in stability and design of switched systems
-
Oct.
-
D. Liberzon and A. S. Morse, "Basic problems in stability and design of switched systems," IEEE Control Systems, vol. 19, pp. 59-70, Oct. 1999.
-
(1999)
IEEE Control Systems
, vol.19
, pp. 59-70
-
-
Liberzon, D.1
Morse, A.S.2
-
5
-
-
0003672832
-
-
PhD thesis, MIT, Cambridge, MA
-
M. Branicky, Studies in Hybrid Systems: Modeling, Analysis and Control. PhD thesis, MIT, Cambridge, MA, 1995.
-
(1995)
Studies in Hybrid Systems: Modeling, Analysis and Control
-
-
Branicky, M.1
-
7
-
-
0029679044
-
Reinforcement learning: A survey
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
8
-
-
0033348437
-
Representation of behavioral history for learning in nonstationary conditions
-
F. Michaud and M. J. Mataric, "Representation of behavioral history for learning in nonstationary conditions," Robotics and Autonomous Systems, vol. 29, no. 2, pp. 187-200, 1999.
-
(1999)
Robotics and Autonomous Systems
, vol.29
, Issue.2
, pp. 187-200
-
-
Michaud, F.1
Mataric, M.J.2
-
9
-
-
0001898381
-
Practical reinforcement learning in continuous spaces
-
Morgan Kaufmann, June 29 - July 2
-
W. D. Smart and L. P. Kaelbling, "Practical reinforcement learning in continuous spaces," in Proceedings of the Seventeenth International Conference on Machine Learning, vol. 17, pp. 903-910, Morgan Kaufmann, June 29 - July 2 2000.
-
(2000)
Proceedings of the Seventeenth International Conference on Machine Learning
, vol.17
, pp. 903-910
-
-
Smart, W.D.1
Kaelbling, L.P.2
-
10
-
-
0348132949
-
Enhancing transfer in reinforcement learning by building stochastic models of robot actions
-
Morgan Kaufmann
-
S. Mahadevan, "Enhancing transfer in reinforcement learning by building stochastic models of robot actions," in Proceedings of the Ninth International Conference on Machine Learning, vol. 9, pp. 290-299, Morgan Kaufmann, 1992.
-
(1992)
Proceedings of the Ninth International Conference on Machine Learning
, vol.9
, pp. 290-299
-
-
Mahadevan, S.1
-
11
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
L. J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Machine Learning, vol. 8, pp. 293-321, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.J.1
-
12
-
-
0030149709
-
Purposive behaviour aquisition for a real robot by vision-based reinforcement learning
-
M. Asada, S. Noda, S. Tawaratsumida, and K. Hosoda, "Purposive behaviour aquisition for a real robot by vision-based reinforcement learning," Machine Learning, vol. 23, pp. 279-303, 1996.
-
(1996)
Machine Learning
, vol.23
, pp. 279-303
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
13
-
-
0036058423
-
Effective reinforcement learning for mobile robots
-
IEEE Intl. Conf. on Robot, and Automat., 2002
-
W. D. Smart and L. P. Kaelbling, "Effective reinforcement learning for mobile robots," in IEEE Int. Conf. on Robotics and Automation, ICRA 02, 2002. IEEE Intl. Conf. on Robot, and Automat., 2002.
-
(2002)
IEEE Int. Conf. on Robotics and Automation, ICRA 02
-
-
Smart, W.D.1
Kaelbling, L.P.2
-
17
-
-
0001794302
-
Localizing search in reinforcement learning
-
Menlo park, CA: AAAI Press / Cambridge, MA: MIT Press, July 30 - August 3
-
G. Z. Grudic and L. H. Ungar, "Localizing search in reinforcement learning," in Proceedings of the Seventeenth National Conference on Artificial Intelligence, vol. 17, pp. 590-595, Menlo park, CA: AAAI Press / Cambridge, MA: MIT Press, July 30 - August 3 2000.
-
(2000)
Proceedings of the Seventeenth National Conference on Artificial Intelligence
, vol.17
, pp. 590-595
-
-
Grudic, G.Z.1
Ungar, L.H.2
-
18
-
-
84898958374
-
Gradient descent for general reinforcement learning
-
M. I. Jordan, M. J. Kearns, and S. A. Solla, eds., Cambridge, MA, MIT Press
-
L. Baird and A. W. Moore, "Gradient descent for general reinforcement learning," in Advances in Neural Information Processing Systems (M. I. Jordan, M. J. Kearns, and S. A. Solla, eds.), vol. 11, (Cambridge, MA), MIT Press, 1999.
-
(1999)
Advances in Neural Information Processing Systems
, vol.11
-
-
Baird, L.1
Moore, A.W.2
-
19
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
R. J. Williams, "Simple statistical gradient-following algorithms for connectionist reinforcement learning," Machine Learning, vol. 8, no. 3, pp. 229-256, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3
, pp. 229-256
-
-
Williams, R.J.1
|