-
3
-
-
57849155854
-
Using generalized learning automata for state space aggregation in mas. Lecture Notes in Computer Science
-
YM. De Hauwere, P. Vrancx, and A. Nowé. Using generalized learning automata for state space aggregation in mas. Lecture Notes in Computer Science, Knowledge-Based Intelligent Information and Engineering Systems (KES 2008), 5177:182-193, 2008.
-
(2008)
Knowledge-Based Intelligent Information and Engineering Systems (KES 2008)
, vol.5177
, pp. 182-193
-
-
De Hauwere, Y.M.1
Vrancx, P.2
Nowé, A.3
-
6
-
-
40949099898
-
Utile coordination: Learning interdependencies among cooperative agents
-
J.R. Kok, P.J. 't Hoen, B. Bakker, and N. Vlassis. Utile coordination: Learning interdependencies among cooperative agents. In Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG05), pages 29-36, 2005.
-
(2005)
Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG05)
, pp. 29-36
-
-
Kok, J.R.1
't Hoen, P.J.2
Bakker, B.3
Vlassis, N.4
-
10
-
-
0028555752
-
Learning to coordinate without sharing information
-
S. Sen, I. Sen, M. Sekaran, and J. Hale. Learning to coordinate without sharing information. In Proceedings of the Twelfth National Conference on Artificial Intelligence, pages 426-431, 1994.
-
(1994)
Proceedings of the Twelfth National Conference on Artificial Intelligence
, pp. 426-431
-
-
Sen, S.1
Sen, I.2
Sekaran, M.3
Hale, J.4
-
12
-
-
85152198941
-
Multi-agent reinforcement learning: Independent vs. cooperative agents
-
Morgan Kaufmann
-
M. Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning, pages 330-337. Morgan Kaufmann, 1993.
-
(1993)
Proceedings of the Tenth International Conference on Machine Learning
, pp. 330-337
-
-
Tan, M.1
-
13
-
-
84873851838
-
Networks of Learning Automata: Techniques for Online Stochastic Optimization
-
M.A.L. Thathachar and P.S. Sastry. Networks of Learning Automata: Techniques for Online Stochastic Optimization. Kluwer Academic Publishers, 2004.
-
(2004)
Kluwer Academic Publishers
-
-
Thathachar, M.A.L.1
Sastry, P.S.2
-
14
-
-
0028497630
-
Asynchronous stochastic approximation and q-learning
-
J.N. Tsitsiklis. Asynchronous stochastic approximation and q-learning. Journal of Machine Learning, 16(3):185-202, 1994.
-
(1994)
Journal of Machine Learning
, vol.16
, Issue.3
, pp. 185-202
-
-
Tsitsiklis, J.N.1
-
16
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
R.J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Journal of Machine Learning, 8(3):229-256, 1992.
-
(1992)
Journal of Machine Learning
, vol.8
, Issue.3
, pp. 229-256
-
-
Williams, R.J.1
|