-
2
-
-
0141988716
-
Recent advances in hierarchical reinforcement learning
-
A. Barto and S. Mahadevan, "Recent advances in hierarchical reinforcement learning," Discrete-Event Syst. J. vol. 13, pp. 41-77, 2003.
-
(2003)
Discrete-event Syst. J.
, vol.13
, pp. 41-77
-
-
Barto, A.1
Mahadevan, S.2
-
4
-
-
1142293055
-
Transition independent decentralized Markov decision problem
-
R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman, "Transition independent decentralized Markov decision problem," in Proceedings of the Third International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2003.
-
(2003)
Proceedings of the Third International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.V.4
-
5
-
-
0031281590
-
Reinforcement and replicator dynamics
-
T. Börgers and R. Sarin, "Reinforcement and replicator dynamics," J. Econ. Theory, vol. 77, no.1, pp. 1-14, 1997.
-
(1997)
J. Econ. Theory
, vol.77
, Issue.1
, pp. 1-14
-
-
Börgers, T.1
Sarin, R.2
-
7
-
-
0001491619
-
A mathematical model for simple learning
-
R. R. Bush and F. Mosteller, "A Mathematical Model for Simple Learning," The Psychol. Rev. vol. 58, pp. 15-18, 1951.
-
(1951)
The Psychol. Rev.
, vol.58
, pp. 15-18
-
-
Bush, R.R.1
Mosteller, F.2
-
10
-
-
0000742255
-
A stochastic learning model of economic behaviour
-
J. G. Cross, "A stochastic learning model of economic behaviour," Quart. J. Econ., vol. 87, no.5, pp. 239-266, 1973.
-
(1973)
Quart. J. Econ.
, vol.87
, Issue.5
, pp. 239-266
-
-
Cross, J.G.1
-
13
-
-
84880803349
-
Generalizing plans to new environments in relational MDPs
-
C. Guestrin, D. Koller, C. Gearhart, and N. Kanodia, "Generalizing plans to new environments in relational MDPs," in International Joint Conference on Artificial Intelligence (IJCAI-03), 2003.
-
(2003)
International Joint Conference on Artificial Intelligence (IJCAI-03)
-
-
Guestrin, C.1
Koller, D.2
Gearhart, C.3
Kanodia, N.4
-
18
-
-
9444236608
-
On no-regret learning fictitious play and nash equilibrium
-
Cambridge University Press
-
C. Jafari, A. Greenwald, D. Gondek, and G. Ercal, "On no-regret learning fictitious play and nash equilibrium," in Proceedings of the Eighteenth International Conference on Machine Learning (ICML), Cambridge University Press, pp. 223-226, 2001.
-
(2001)
Proceedings of the Eighteenth International Conference on Machine Learning (ICML)
, pp. 223-226
-
-
Jafari, C.1
Greenwald, A.2
Gondek, D.3
Ercal, G.4
-
20
-
-
0029679044
-
Reinforcement learning: A survey
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement Learning: A Survey," J. Artif. Intell. Res. vol. 4, pp. 237-285, 1996.
-
(1996)
J. Artif. Intell. Res.
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
21
-
-
0012286079
-
An algorithm for distributed reinforcement learning in cooperative multi-agent systems
-
Morgan Kaufmann: San Francisco, CA
-
M. Lauer and M. Riedmiller, "An algorithm for distributed reinforcement learning in cooperative multi-agent systems," in Proc. 17th International Conf. on Machine Learning Morgan Kaufmann: San Francisco, CA, pp. 535-542, 2000.
-
(2000)
Proc. 17th International Conf. on Machine Learning
, pp. 535-542
-
-
Lauer, M.1
Riedmiller, M.2
-
24
-
-
34548719708
-
The logic of animal conflict
-
J. Maynard Smith, and G. R. Price, "The logic of animal conflict," Nature, vol. 146, no. 2, pp. 15-18, 1973.
-
(1973)
Nature
, vol.146
, Issue.2
, pp. 15-18
-
-
Smith, J.M.1
Price, G.R.2
-
25
-
-
0016082525
-
Learning automata: A survey
-
K. Narendra and M. Thathachar, "Learning automata: A survey," IEEE Trans. Syst. Man Cybernet, vol. 14, no.5, pp. 323-334, 1974.
-
(1974)
IEEE Trans. Syst. Man Cybernet
, vol.14
, Issue.5
, pp. 323-334
-
-
Narendra, K.1
Thathachar, M.2
-
27
-
-
84948131383
-
Social agents playing a periodical policy
-
Proceedings of the 12th European Conference on Machine Learning, Springer
-
A. Nowé, J. Parent, and K. Verbeeck, "Social agents playing a periodical policy," in Proceedings of the 12th European Conference on Machine Learning, Volume 2176 of Lecture Notes in Artificial Intelligence, Springer, pp. 382-393, 2001.
-
(2001)
Lecture Notes in Artificial Intelligence
, vol.2176
, pp. 382-393
-
-
Nowé, A.1
Parent, J.2
Verbeeck, K.3
-
28
-
-
4544335718
-
Run the GAMUT: A comprehensive approach to evaluating game-theoretic algorithms, algorithms
-
E. Nudelman, J. Wortman, K. Leyton-Brown, and Y. Shoham, "Run the GAMUT: A comprehensive approach to evaluating game-theoretic algorithms, algorithms," in Proceedings of the Fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2004.
-
(2004)
Proceedings of the Fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
-
-
Nudelman, E.1
Wortman, J.2
Leyton-Brown, K.3
Shoham, Y.4
-
30
-
-
3142772701
-
Adaptive load balancing of parallel applications with social reinforement learning on heterogeneous sysems
-
to appear
-
J. Parent, K. Verbeeck, A. Nowé, K. Steenhaut, J. Lemeire, and E. Dirkx, "Adaptive load balancing of parallel applications with social reinforement learning on heterogeneous sysems," J. Sci. Program. 2004. to appear.
-
(2004)
J. Sci. Program.
-
-
Parent, J.1
Verbeeck, K.2
Nowé, A.3
Steenhaut, K.4
Lemeire, J.5
Dirkx, E.6
-
31
-
-
31344476554
-
An evolutionary game-theoretic comparison of two double-action market designs
-
Workshop on Agent Medicated Electronic commerce VI: Theories for Engineering of Distributed Mechanisms and Systems (AMEC'04), Springer
-
S. Phelps, S. Parsons, and P. McBurney, "An evolutionary game-theoretic comparison of two double-action market designs," in Workshop on Agent Medicated Electronic commerce VI: Theories for Engineering of Distributed Mechanisms and Systems (AMEC'04), Volume 2531 of Lecture Notes in Artificial Intelligence, Springer, pp. 109-118, 2004.
-
(2004)
Lecture Notes in Artificial Intelligence
, vol.2531
, pp. 109-118
-
-
Phelps, S.1
Parsons, S.2
McBurney, P.3
-
35
-
-
0034661690
-
Evolution of biological information
-
T. D. Schneider, "Evolution of biological information," J. Nucl. Acid Res. vol. 28, no. 14, pp. 2794-2799, 2000.
-
(2000)
J. Nucl. Acid Res.
, vol.28
, Issue.14
, pp. 2794-2799
-
-
Schneider, T.D.1
-
38
-
-
22944450534
-
Collective INtelligence with sequence of actions
-
14th European conference on Machine Learning, Springer
-
P. J. 't Hoen and S. M. Bohte, "Collective INtelligence with sequence of actions," in 14th European conference on Machine Learning, Volume 2837 of Lecture Notes in Articifical Intelligence, Springer, 2003.
-
(2003)
Lecture Notes in Articifical Intelligence
, vol.2837
-
-
'T Hoen, P.J.1
Bohte, S.M.2
-
42
-
-
0036894214
-
Varieties of learning automata: An overview
-
P. S. Sastry and M. A. L. Thathacher, "Varieties of Learning Automata: An Overview," IEEE Trans. Sys. Man Cybernet, vol. 32, no. 6, pp. 323-334, 2002.
-
(2002)
IEEE Trans. Sys. Man Cybernet
, vol.32
, Issue.6
, pp. 323-334
-
-
Sastry, P.S.1
Thathacher, M.A.L.2
-
43
-
-
0028497630
-
Asynchronous stochastic approximation and Q-learning
-
J. N. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Machine Learn, vol. 16, pp. 185-202, 1994.
-
(1994)
Machine Learn
, vol.16
, pp. 185-202
-
-
Tsitsiklis, J.N.1
-
44
-
-
85158118268
-
Collective INtelligence and braess' paradox
-
Austin, August
-
K. Tumer and D. Wolpert, "Collective INtelligence and Braess' Paradox," in Proceedings of the Sixteenth National Conference on Artificial Intelligence, Austin, pp. 104-109, August, 2000.
-
(2000)
Proceedings of the Sixteenth National Conference on Artificial Intelligence
, pp. 104-109
-
-
Tumer, K.1
Wolpert, D.2
-
45
-
-
4344576577
-
-
Ph.D. dissertation, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium
-
K. Tuyls, Learning in Multi-Agent Systems, An Evolutionary Game Theoretic Approach, Ph.D. dissertation, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium, 2004.
-
(2004)
Learning in Multi-agent Systems, an Evolutionary Game Theoretic Approach
-
-
Tuyls, K.1
-
46
-
-
9444229990
-
Extended replicator dynamics as a key to reinforcement learning in multi-agent systems
-
Proceedings of the 14th European Conference on Machine Learning (ECML), Springer
-
K. Tuyls, D. Heytens, A. Nowé, and B. Manderick, "Extended Replicator Dynamics as a Key to Reinforcement Learning in Multi-Agent Systems," in Proceedings of the 14th European Conference on Machine Learning (ECML), Volume 2837, of Lecture Notes in Artificial Intelligence, Springer, 2003.
-
(2003)
Lecture Notes in Artificial Intelligence
, vol.2837
-
-
Tuyls, K.1
Heytens, D.2
Nowé, A.3
Manderick, B.4
-
47
-
-
1142305721
-
Towards a relation between learning agents and evolutionary dynamics
-
Cambridge University Press
-
K. Tuyls, T. Lenaerts, K. Verbeeck, S. Maes and B. Manderick, "Towards a relation between learning agents and evolutionary dynamics", in Proceedings of the Belgian-Dutch Conference on Artificial Intelligence (BNAIC 2002), Cambridge University Press, pp. 223-226, 2002.
-
(2002)
Proceedings of the Belgian-Dutch Conference on Artificial Intelligence (BNAIC 2002)
, pp. 223-226
-
-
Tuyls, K.1
Lenaerts, T.2
Verbeeck, K.3
Maes, S.4
Manderick, B.5
-
48
-
-
31344438454
-
An evolutionary game theoretic perspective on learning in multi-agent systems
-
Kluwer Academic Publishers
-
K. Tuyls, A. Nowe, T. Lenaerts, and B. Manderick, "An evolutionary game theoretic perspective on learning in multi-agent systems," in Synthese, Section Knowledge, Rationality and Action, Kluwer Academic Publishers, 2004, vol. 139, no. 2, pp. 297-330.
-
(2004)
Synthese, Section Knowledge, Rationality and Action
, vol.139
, Issue.2
, pp. 297-330
-
-
Tuyls, K.1
Nowe, A.2
Lenaerts, T.3
Manderick, B.4
-
50
-
-
31344463262
-
Homo egualis reinforcement learning agents for load balancing
-
Proceedings of the 1st NASA Workshop on Radical Agent Concepts, Springer
-
K. Verbeeck, A. Nowé, and J. Parent, "Homo egualis reinforcement learning agents for load balancing," in Proceedings of the 1st NASA Workshop on Radical Agent Concepts, Volume 2564 of Lecture Notes in Artificial Intelligence, Springer, pp. 109-118, 2002.
-
(2002)
Lecture Notes in Artificial Intelligence
, vol.2564
, pp. 109-118
-
-
Verbeeck, K.1
Nowé, A.2
Parent, J.3
-
52
-
-
1842545000
-
Analyzing complex strategic interactions in multi-agent games
-
Springer
-
W. E. Walsh, R. Das, G. Tesauro, and J. O. Kephart, "Analyzing complex strategic interactions in multi-agent games," in Proceedings of the The Eighteenth National Conference on Artificial Intelligence (AAAI-02) Workshop on Game Theoretic, and Decision Theoretic Agents, Lecture Notes in Artificial Intelligence, Springer, pp. 109-118, 2002.
-
(2002)
Proceedings of the the Eighteenth National Conference on Artificial Intelligence (AAAI-02) Workshop on Game Theoretic, and Decision Theoretic Agents, Lecture Notes in Artificial Intelligence
, pp. 109-118
-
-
Walsh, W.E.1
Das, R.2
Tesauro, G.3
Kephart, J.O.4
-
55
-
-
84899033169
-
Using collective INtelligence to route internet traffic
-
Denver
-
David H. Wolpert, Kagan Turner, and Jeremy Frank, "Using Collective INtelligence to route internet traffic," in Advances in Neural Information Processing Systems-II, Denver, pp. 952-958, 1998.
-
(1998)
Advances in Neural Information Processing Systems-II
, pp. 952-958
-
-
Wolpert, D.H.1
Turner, K.2
Frank, J.3
-
56
-
-
0032691530
-
General principles of learning-based multi-agent systems
-
Oren Etzioni and Jörg P. Müller and Jeffrey M. Bradshaw (ed.), ACM Press: Seattle, WA, USA
-
David H. Wolpert, Kevin R. Wheler, and Kagan Turner, "General Principles of learning-based multi-agent systems", in Oren Etzioni and Jörg P. Müller and Jeffrey M. Bradshaw (ed.), Proceedings of the Third International Conference on Autonomous Agents (Agents'99), ACM Press: Seattle, WA, USA, pp. 77-83, 1999.
-
(1999)
Proceedings of the Third International Conference on Autonomous Agents (Agents'99)
, pp. 77-83
-
-
Wolpert, D.H.1
Wheler, K.R.2
Turner, K.3
|