-
3
-
-
84898958374
-
Gradient descent for general reinforcement learning
-
Cambridge, MA. The MIT Press
-
L. Baird and A. Moore. Gradient descent for general reinforcement learning. In Advances in Neural Information Processing Systems (NIPS), pages 968-974, Cambridge, MA, 1999. The MIT Press.
-
(1999)
Advances in Neural Information Processing Systems (NIPS)
, pp. 968-974
-
-
Baird, L.1
Moore, A.2
-
4
-
-
0345833118
-
Visualization methods for neural networks
-
The Hague, Netherlands
-
Horst Bishof, Axel Pinz, and Walter G. Kropatsch. Visualization methods for neural networks. In 11th International Conference on Pattern Recognition, pages 581-585, The Hague, Netherlands, 1992.
-
(1992)
11th International Conference on Pattern Recognition
, pp. 581-585
-
-
Bishof, H.1
Pinz, A.2
Kropatsch, W.G.3
-
6
-
-
85156187730
-
Improving elevator performance using reinforcement learning
-
D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, MIT Press
-
R. H. Crites and A. G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems - 8, pages 1017-1023. MIT Press, 1996.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1017-1023
-
-
Crites, R.H.1
Barto, A.G.2
-
9
-
-
0024732792
-
Connectionist learning procedures
-
G. Hinton. Connectionist learning procedures. Artificial Intelligence, 40:185-234, 1986.
-
(1986)
Artificial Intelligence
, vol.40
, pp. 185-234
-
-
Hinton, G.1
-
10
-
-
4544307742
-
Simulation and visualization of a market-based model for logistics management in transportation
-
New York, NY, July
-
P. Hoen and H. La Poutre G. Redekar, V. Robu. Simulation and visualization of a market-based model for logistics management in transportation. In Proceedings of the Third International Joint Conference on Autonomous Agents and Multi-Agent Systems, pages 1218-1219, New York, NY, July 2004.
-
(2004)
Proceedings of the Third International Joint Conference on Autonomous Agents and Multi-agent Systems
, pp. 1218-1219
-
-
Hoen, P.1
La Poutre, H.2
Redekar, G.3
Robu, V.4
-
12
-
-
0002218603
-
Coordination and learning in multi-robot systems
-
March
-
Maja J Mataric. Coordination and learning in multi-robot systems. In IEEE Intelligent Systems, pages 6-8, March 1998.
-
(1998)
IEEE Intelligent Systems
, pp. 6-8
-
-
Mataric, M.J.1
-
14
-
-
51649111071
-
Designing agent utilities for coordinated, scalable and robust multi-agent systems
-
P. Scerri, R. Mailler, and R. Vincent, editors, Springer. to appear
-
K. Tumer. Designing agent utilities for coordinated, scalable and robust multi-agent systems. In P. Scerri, R. Mailler, and R. Vincent, editors, Challenges in the Coordination of Large Scale Multiagent Sy stems. Springer, 2005. to appear.
-
(2005)
Challenges in the Coordination of Large Scale Multiagent Sy Stems
-
-
Tumer, K.1
-
15
-
-
0036355687
-
Learning sequences of actions in collectives of autonomous agents
-
Bologna, Italy, July
-
K. Tumer, A. Agogino, and D. Wolpert. Learning sequences of actions in collectives of autonomous agents. In Proceedings of the First International Joint Conference on Autonomous Agents and Multi-Agent Systems, pages 378-385, Bologna, Italy, July 2002.
-
(2002)
Proceedings of the First International Joint Conference on Autonomous Agents and Multi-agent Systems
, pp. 378-385
-
-
Tumer, K.1
Agogino, A.2
Wolpert, D.3
-
20
-
-
0033705642
-
Adaptivity in agent-based routing for data networks
-
D. H. Wolpert, S. Kirshner, C. J. Merz, and K. Tumer. Adaptivity in agent-based routing for data networks. In Proceedings of the fourth International Conference of Autonomous Agents, pages 396-403, 2000.
-
(2000)
Proceedings of the Fourth International Conference of Autonomous Agents
, pp. 396-403
-
-
Wolpert, D.H.1
Kirshner, S.2
Merz, C.J.3
Tumer, K.4
-
21
-
-
0001309161
-
Optimal payoff functions for members of collectives
-
D. H. Wolpert and K. Tumer. Optimal payoff functions for members of collectives. Advances in Complex Systems, 4(2/3):265-279, 2001.
-
(2001)
Advances in Complex Systems
, vol.4
, Issue.2-3
, pp. 265-279
-
-
Wolpert, D.H.1
Tumer, K.2
-
22
-
-
1842531912
-
Improving search algorithms by using intelligent coordinates
-
D. H. Wolpert, K. Tumer, and E. Bandari. Improving search algorithms by using intelligent coordinates. Physical Review E, 69:017701, 2004.
-
(2004)
Physical Review E
, vol.69
, pp. 017701
-
-
Wolpert, D.H.1
Tumer, K.2
Bandari, E.3
-
23
-
-
0034635650
-
Collective intelligence for control of distributed dynamical systems
-
March
-
D. H. Wolpert, K. Wheeler, and K. Tumer. Collective intelligence for control of distributed dynamical systems. Europhysics Letters, 49(6), March 2000.
-
(2000)
Europhysics Letters
, vol.49
, Issue.6
-
-
Wolpert, D.H.1
Wheeler, K.2
Tumer, K.3
|