-
1
-
-
0013465186
-
-
Leen, T. K., Dietterich, T. G. and Tresp, V. (eds). Cambridge, MA, MIT Press.
-
Andre, D. and Russell, S. (2000). Programmable reinforcement learning agents. Advances in Neural Information Processing Systems Leen, T. K., Dietterich, T. G. and Tresp, V. (eds). Cambridge, MA, MIT Press.
-
(2000)
Programmable Reinforcement Learning Agents. Advances in Neural Information Processing Systems
-
-
Andre, D.1
Russell, S.2
-
2
-
-
84898962948
-
Policy search by dynamic programming
-
Cambridge, MA, MIT Press.
-
Bagnell, J.A. et al. (2004). Policy search by dynamic programming. Advances in Neural Information Processing Systems, Vol. 16, Thrun, S., Saul, L. K. and Scholkopf, B. (eds). Cambridge, MA, MIT Press.
-
(2004)
Advances in Neural Information Processing Systems
, vol.16
-
-
Bagnell, J.A.1
-
6
-
-
4444267710
-
Generic distributed control for locomotion with self-reconfiguring robots
-
Butler, Z. et al. (2004). Generic distributed control for locomotion with self-reconfiguring robots. International Journal of Robotics Research, 23 (9). 919 - 938.
-
(2004)
International Journal of Robotics Research
, vol.23
, Issue.9
, pp. 919-938
-
-
Butler, Z.1
-
7
-
-
84899032145
-
All learning is local: Multi-agent learning in global reward games
-
Cambridge, MA, MIT Press.
-
Chang, Y.-H., Ho, T. and Kaelbling, L.P. (2004). All learning is local: multi-agent learning in global reward games. Advances in Neural Information Processing Systems, Vol. 16, Thrun, S., Saul, L. K. and Scholkopf, B. (eds). Cambridge, MA, MIT Press.
-
(2004)
Advances in Neural Information Processing Systems
, vol.16
-
-
Chang, Y.-H.1
Ho, T.2
Kaelbling, L.P.3
-
11
-
-
84899028010
-
Multiagent planning with factored MDPs
-
Cambridge, MA, MIT Press.
-
Guestrin, C., Koller, D. and Parr, R. (2002). Multiagent planning with factored MDPs. Advances in Neural Information Processing Systems, Vol. 14 Dietterich, T. G., Becker, S. and Ghahramani, Z. (eds). Cambridge, MA, MIT Press.
-
(2002)
Advances in Neural Information Processing Systems
, vol.14
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
-
12
-
-
14044276980
-
Distributed adaptive locomotion by a modular robotic system, M-TRAN II-from local adaptation to global coordinated motion using cpg controllers
-
Kamimura, A. et al. (2004). Distributed adaptive locomotion by a modular robotic system, M-TRAN II-from local adaptation to global coordinated motion using cpg controllers. Proceedings of the International Conference on Intelligent Robots and Systems, Sendai, Japan.
-
Proceedings of the International Conference on Intelligent Robots and Systems
-
-
Kamimura, A.1
-
14
-
-
33748543203
-
Collaborative multiagent reinforcement learning by payoff propagation
-
Kok, J.R. and Vlassis, N. (2006). Collaborative multiagent reinforcement learning by payoff propagation. Journal of Machine Learning Research, 7: 1789 - 1828.
-
(2006)
Journal of Machine Learning Research
, vol.7
, pp. 1789-1828
-
-
Kok, J.R.1
Vlassis, N.2
-
19
-
-
0030647149
-
Reinforcement learning in the multi-robot domain
-
Mataric, M.J. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4 (1). 73 - 83.
-
(1997)
Autonomous Robots
, vol.4
, Issue.1
, pp. 73-83
-
-
Mataric, M.J.1
-
27
-
-
4544279348
-
-
Palo Alto, CA, Stanford University.
-
Shoham, Y., Powers, R. and Grenager, T. (2003). Multi-agent reinforcement learning: a critical survey. Technical Report, Palo Alto, CA, Stanford University.
-
(2003)
Multi-agent Reinforcement Learning: A Critical Survey. Technical Report
-
-
Shoham, Y.1
Powers, R.2
Grenager, T.3
-
28
-
-
0034205975
-
Multiagent systems: A survey from a machine learning perspective
-
Stone, P. and Veloso, M.M. (2000). Multiagent systems: a survey from a machine learning perspective, Autonomous Robots 8 (3). 345 - 383.
-
(2000)
Autonomous Robots
, vol.8
, Issue.3
, pp. 345-383
-
-
Stone, P.1
Veloso, M.M.2
-
29
-
-
85156221438
-
-
Touretzky, D. S., Mozer, M. C. and Hasselmo, M. E. (eds). Cambridge, MA, MIT Press, pp.
-
Sutton, R.S. (1995). Generatlization in reinforcement learning: successful examples using sparse coarse coding. Advances in Neural Information Processing Systems, Touretzky, D. S., Mozer, M. C. and Hasselmo, M. E. (eds). Cambridge, MA, MIT Press, pp. 1038 - 1044.
-
(1995)
Generatlization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. Advances in Neural Information Processing Systems
, pp. 1038-1044
-
-
Sutton, R.S.1
-
31
-
-
84898939480
-
-
Leen, T. K., Dietterich, T. G. and Tresp, V. (eds), Vol. 12. Cambridge, MA, MIT Press.
-
Sutton, R.S. et al. (2000). Policy gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems, Leen, T. K., Dietterich, T. G. and Tresp, V. (eds), Vol. 12. Cambridge, MA, MIT Press.
-
(2000)
Policy Gradient Methods for Reinforcement Learning with Function Approximation. Advances in Neural Information Processing Systems
-
-
Sutton, R.S.1
-
36
-
-
40049106373
-
Using interaction-based learning to construct an adaptive and fault-tolerant multi-link floating robot
-
Yu, W. et al. (2002). Using interaction-based learning to construct an adaptive and fault-tolerant multi-link floating robot. Proceedings of the International Workshop on Distributed Autonomous Robotic Systems (DARS), Vol. 5, Asama, H., Arai, T., Fukuda, T. and Hasegawa, T. (eds). Berlin, Springer, pp. 455-464.
-
Proceedings of the International Workshop on Distributed Autonomous Robotic Systems (DARS)
-
-
Yu, W.1
-
37
-
-
18744399581
-
Self-reproducing machines
-
Zykov, V. et al. (2005). Self-reproducing machines. Nature, 435 (7038). 163 - 164.
-
(2005)
Nature
, vol.435
, Issue.7038
, pp. 163-164
-
-
Zykov, V.1
|