-
1
-
-
0037288370
-
Recent advances in hierarchical reinforcement learning
-
to appear
-
A. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete-Event Systems journal, 2003. to appear.
-
(2003)
Discrete-event Systems Journal
-
-
Barto, A.1
Mahadevan, S.2
-
3
-
-
0014413249
-
The tragedy of the commons
-
G. Hardin. The tragedy of the commons. Science, 162:1243-1248, 1968.
-
(1968)
Science
, vol.162
, pp. 1243-1248
-
-
Hardin, G.1
-
4
-
-
0012286079
-
An algorithm for distributed reinforcement learning in cooperative multi-agent systems
-
Morgan Kaufmann, San Francisco, CA
-
M. Lauer and M. Riedmiller. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In Proc. 17th International Conf. on Machine Learning, pages 535-542. Morgan Kaufmann, San Francisco, CA, 2000.
-
(2000)
Proc. 17th International Conf. on Machine Learning
, pp. 535-542
-
-
Lauer, M.1
Riedmiller, M.2
-
6
-
-
9444230770
-
-
Personal communication with A. Agogino
-
Personal communication with A. Agogino.
-
-
-
-
7
-
-
84945250000
-
Q-cut - Dynamic discovery of sub-goals in reinforcement learning
-
T. Elomaa, H. Mannila, and H. Toivonen, editors. Springer
-
shai Menache, S. Mannor, and N. Shimkin. Q-cut - dynamic discovery of sub-goals in Reinforcement Learning. In T. Elomaa, H. Mannila, and H. Toivonen, editors, Machine Learning: ECML 2002, 13th European Conference on Machine Learning, volume 2430 of LectureNotes in Computer Science, pages 295-306. Springer, 2002.
-
(2002)
Machine Learning: ECML 2002, 13th European Conference on Machine Learning, Volume 2430 of LectureNotes in Computer Science
, vol.2430
, pp. 295-306
-
-
Menache, S.1
Mannor, S.2
Shimkin, N.3
-
9
-
-
0003411271
-
Efficient exploration in reinforcement learning
-
Carnegie Mellon University, Pittsburgh, Pennsylvania
-
S. B. Thrun. Efficient exploration in reinforcement learning. Technical Report CMU-CS-92-102, Carnegie Mellon University, Pittsburgh, Pennsylvania, 1992.
-
(1992)
Technical Report
, vol.CMU-CS-92-102
-
-
Thrun, S.B.1
-
10
-
-
0036355687
-
Learning sequences of actions in collectives of autonomous agents
-
ACM press
-
K. Turner, A. Agogino, and D. Wolpert. Learning sequences of actions in collectives of autonomous agents. In Autonomous Agents & Multiagent Systems, pages 378-385, part 1. ACM press, 2002.
-
(2002)
Autonomous Agents & Multiagent Systems
, Issue.PART 1
, pp. 378-385
-
-
Turner, K.1
Agogino, A.2
Wolpert, D.3
-
12
-
-
34249833101
-
Q-learning
-
Watkins and Dayan. Q-learning. Machine Learning, 8:279-292, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins1
Dayan2
-
13
-
-
9444288363
-
A multiagent framework for planning, reacting, and learning
-
Institut für Informatik, Technische Universität München
-
G. Weiss. A multiagent framework for planning, reacting, and learning. Technical Report FKI-233-99, Institut für Informatik, Technische Universität München, 1999.
-
(1999)
Technical Report
, vol.FKI-233-99
-
-
Weiss, G.1
-
14
-
-
9444234731
-
The economic approach to artificial intelligence
-
M. P. Wellman. The economic approach to artificial intelligence. ACM Computing Surveys, 28(4es):14-15, 1996.
-
(1996)
ACM Computing Surveys
, vol.28
, Issue.4 ES
, pp. 14-15
-
-
Wellman, M.P.1
-
15
-
-
0013250428
-
Market-oriented programming: Some early lessons
-
S. Clearwater, editor, World Scientific, River Edge, New Jersey
-
M. P. Wellman. Market-oriented programming: Some early lessons. In S. Clearwater, editor, Market-Based Control: A Paradigm for Distributed Resource Allocation. World Scientific, River Edge, New Jersey, 1996.
-
(1996)
Market-based Control: A Paradigm for Distributed Resource Allocation
-
-
Wellman, M.P.1
-
17
-
-
0004320981
-
An introduction to COllective INtelligence
-
NASA Ames Research Center
-
D. Wolpert and K. Tumer. An introduction to COllective INtelligence. Technical Report NASA-ARC-IC-99-63, NASA Ames Research Center, 1999. A shorter version of this paper is to appear in: Jeffrey M. Bradshaw, editor, Handbook of Agent Technology, AAAI Press/MIT Press, 1999.
-
(1999)
Technical Report
, vol.NASA-ARC-IC-99-63
-
-
Wolpert, D.1
Tumer, K.2
-
18
-
-
0347885021
-
-
AAAI Press/MIT Press
-
D. Wolpert and K. Tumer. An introduction to COllective INtelligence. Technical Report NASA-ARC-IC-99-63, NASA Ames Research Center, 1999. A shorter version of this paper is to appear in: Jeffrey M. Bradshaw, editor, Handbook of Agent Technology, AAAI Press/MIT Press, 1999.
-
(1999)
Handbook of Agent Technology
-
-
Bradshaw, J.M.1
-
19
-
-
0001309161
-
Optimal payoff functions for members of collectives
-
in press
-
D. Wolpert and K. Turner. Optimal payoff functions for members of collectives. Advances in Complex Systems, 2001. in press.
-
(2001)
Advances in Complex Systems
-
-
Wolpert, D.1
Turner, K.2
-
20
-
-
84899033169
-
Using collective intelligence to route internet traffic
-
Denver, Dec.
-
D. H. Wolpert, K. Turner, and J. Frank. Using collective intelligence to route internet traffic. In Advances in Neural Information Processing Systems-11, pages 952-958, Denver, Dec. 1998.
-
(1998)
Advances in Neural Information Processing Systems-11
, pp. 952-958
-
-
Wolpert, D.H.1
Turner, K.2
Frank, J.3
-
22
-
-
0032691530
-
General principles of learning-based multi-agent systems
-
O. Etzioni, J. P. Müller, and J. M. Bradshaw, editors, New York, May 1-5. ACM Press
-
D. H. Wolpert, K. R. Wheeler, and K. Tumer. General principles of learning-based multi-agent systems. In O. Etzioni, J. P. Müller, and J. M. Bradshaw, editors, Proceedings of the Third Annual Conference on Autonomous Agents (AGENTS-99), pages 77-83, New York, May 1-5 1999. ACM Press.
-
(1999)
Proceedings of the Third Annual Conference on Autonomous Agents (AGENTS-99)
, pp. 77-83
-
-
Wolpert, D.H.1
Wheeler, K.R.2
Tumer, K.3
|