-
3
-
-
0031281590
-
Learning through reinforcement and replicator dynamics
-
Börgers, T. and R. Sarin: 1997, 'Learning through Reinforcement and Replicator Dynamics', Journal of Economic Theory 77(1).
-
(1997)
Journal of Economic Theory
, vol.77
, Issue.1
-
-
Börgers, T.1
Sarin, R.2
-
4
-
-
34250513249
-
Uber ein paradoxen aus der Verkehrsplanung
-
Braess D.: 1968, 'Uber ein paradoxen aus der Verkehrsplanung', Unternehmensforschung 12, 258.
-
(1968)
Unternehmensforschung
, vol.12
, pp. 258
-
-
Braess, D.1
-
12
-
-
9444236608
-
On no-regret learning, fictitious play, and nash equilibrium
-
Jafari, C., A. Greenwald, D. Gondek, and G. Ercal: 2001, 'On No-Regret Learning, Fictitious Play, and Nash Equilibrium', in Proceedings of the Eighteenth International Conference on Machine Learning, pp. 223-226.
-
(2001)
Proceedings of the Eighteenth International Conference on Machine Learning
, pp. 223-226
-
-
Jafari, C.1
Greenwald, A.2
Gondek, D.3
Ercal, G.4
-
16
-
-
33751180011
-
A roadmap for agent based computing
-
Luck, M., P. McBurney, and C. Preist: 2003, 'A Roadmap for Agent Based Computing', AgentLink, Nehvork of Excellence.
-
(2003)
AgentLink, Nehvork of Excellence
-
-
Luck, M.1
McBurney, P.2
Preist, C.3
-
18
-
-
34548719708
-
The logic of animal conflict
-
Maynard Smith, J. and G. R. Price: 1973, 'The Logic of Animal Conflict', Nature 146, 15-18.
-
(1973)
Nature
, vol.146
, pp. 15-18
-
-
Maynard Smith, J.1
Price, G.R.2
-
20
-
-
84948131383
-
Social agents playing a periodical policy
-
Nowé, A., J. Parent, and K. Verbeeck: 2001, 'Social Agents Playing a Periodical Policy', in Proceedings of the 12th European Conference on Machine Learning, pp. 382-393.
-
(2001)
Proceedings of the 12th European Conference on Machine Learning
, pp. 382-393
-
-
Nowé, A.1
Parent, J.2
Verbeeck, K.3
-
21
-
-
0011847654
-
Distributed reinforcement learning, loadbased routing a case study
-
Stockholm, Sweden
-
Nowé A. and K. Verbeeck: 1999, 'Distributed Reinforcement learning, Loadbased Routing a Case Study', Notes of the Neural, Symbolic and Reinforcement Methods for Sequence Learning Workshop at ijcai99, Stockholm, Sweden.
-
(1999)
Notes of the Neural, Symbolic and Reinforcement Methods for Sequence Learning Workshop at Ijcai99
-
-
Nowé, A.1
Verbeeck, K.2
-
27
-
-
33751186226
-
-
Robocup project
-
Robocup project: 2003, 'The Official Robocup Website at www.robocup.org, Robocup.
-
(2003)
Robocup
-
-
-
30
-
-
1142293590
-
-
Institute for Theoretical Physics, Koln, Euroland
-
Stauffer, D.: 1999, Life, Love and Death: Models of Biological Reproduction and Aging, Institute for Theoretical Physics, Koln, Euroland.
-
(1999)
Life, Love and Death: Models of Biological Reproduction and Aging
-
-
Stauffer, D.1
-
34
-
-
1142305721
-
Towards a relation between learning agents and evolutionary dynamics
-
KU Leuven, Belgium
-
Tuyls, K., T. Lenaerts, K. Verbeeck, S. Maes, and B. Manderick: 2002, 'Towards a Relation between Learning Agents and Evolutionary Dynamics', in Proceedings of the Belgium-Netherlands Artificial Intelligence Conference 2002 (BNAIC), KU Leuven, Belgium.
-
(2002)
Proceedings of the Belgium-Netherlands Artificial Intelligence Conference 2002 (BNAIC)
-
-
Tuyls, K.1
Lenaerts, T.2
Verbeeck, K.3
Maes, S.4
Manderick, B.5
-
35
-
-
8344263004
-
On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam's Razor
-
Lecture Notes in Artificial Intelligence, Multi-Agent Systems and Applications III, (Central and Eastern European conference on Multi-Agent Systems 2003), Prague, 16-18 June 2003, Czech Republic
-
Tuyls, K., K. Verbeeck, and S. Maes: 2003a, 'On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam's Razor, Lecture Notes in Artificial Intelligence, Multi-Agent Systems and Applications III, Lecture Notes in AI 2691, (Central and Eastern European conference on Multi-Agent Systems 2003), Prague, 16-18 June 2003, Czech Republic.
-
(2003)
Lecture Notes in AI
, vol.2691
-
-
Tuyls, K.1
Verbeeck, K.2
Maes, S.3
-
36
-
-
26444437242
-
A selection-mutation model for Q-learning in multi-agent systems
-
Melbourne, 14-18 July 2003, Australia
-
Tuyls, K., K. Verbeeck, and T. Lenaerts, T.: 2003b, 'A Selection-Mutation Model for Q-Learning in Multi-Agent Systems', in The ACM International Conference Proceedings Series, Autonomous Agents and Multi-Agent Systems 2003, Melbourne, 14-18 July 2003, Australia.
-
(2003)
The ACM International Conference Proceedings Series, Autonomous Agents and Multi-agent Systems 2003
-
-
Tuyls, K.1
Verbeeck, K.2
Lenaerts, T.3
-
37
-
-
9444229990
-
Extended replicator dynamics as a key to reinforcement learning in multi-agent systems
-
Cavtat-Dubrovnik, 22-26 September 2003, Croatia
-
Tuyls, K., D. Heytens, A. Nowe, and B. Manderick: 2003c, 'Extended Replicator Dynamics as a Key to Reinforcement Learning in Multi-Agent Systems', Proceedings of the European Conference on Machine Learning'03, Lecture Notes in Artificial Intelligence, Cavtat-Dubrovnik, 22-26 September 2003, Croatia.
-
(2003)
Proceedings of the European Conference on Machine Learning'03, Lecture Notes in Artificial Intelligence
-
-
Tuyls, K.1
Heytens, D.2
Nowe, A.3
Manderick, B.4
-
40
-
-
0003744207
-
-
Gerard Weiss (ed.), MIT Press, Cambridge, MA
-
Weiss, G.: 1999, in Gerard Weiss (ed.), Multiagent Systems. A Modem Approach to Distributed Artificial Intelligence, MIT Press, Cambridge, MA.
-
(1999)
Multiagent Systems. A Modem Approach to Distributed Artificial Intelligence
-
-
Weiss, G.1
|