-
1
-
-
0002430114
-
Subjectivity and correlation in randomized strategies
-
Aumann, R. (1974). Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1, 67-96.
-
(1974)
Journal of Mathematical Economics
, vol.1
, pp. 67-96
-
-
Aumann, R.1
-
8
-
-
4644369748
-
Nash q-learning for general-sum stochastic games
-
Hu, J., & Wellman, M. (2003). Nash q-learning for general-sum stochastic games. Journal of Machine Learning Research, 4, 1039-1069.
-
(2003)
Journal of Machine Learning Research
, vol.4
, pp. 1039-1069
-
-
Hu, J.1
Wellman, M.2
-
10
-
-
4143053349
-
Learning to coordinate using commitment sequences in cooperative multi-agent systems
-
Society for the study of Artificial Intelligence and Simulation of Behaviour
-
Kapetanakis, S., Kudenko, D., & Strens, M. (2003). Learning to coordinate using commitment sequences in cooperative multi-agent systems. In Proceedings of the 3rd symposium on adaptive agents and multi-agent systems, (AISB03) Society for the study of Artificial Intelligence and Simulation of Behaviour.
-
(2003)
Proceedings of the 3rd symposium on adaptive agents and multi-agent systems, (AISB03)
-
-
Kapetanakis, S.1
Kudenko, D.2
Strens, M.3
-
17
-
-
84948131383
-
Social agents playing a periodical policy
-
Freiburg, Germany: Springer-Verlag LNAI2168
-
Nowé, A., Parent, J., & Verbeeck, K. (2001). Social agents playing a periodical policy. In Proceedings of the 12th European conference on machine learning pp. 382-393. Freiburg, Germany: Springer-Verlag LNAI2168.
-
(2001)
Proceedings of the 12th European conference on machine learning
, pp. 382-393
-
-
Nowé, A.1
Parent, J.2
Verbeeck, K.3
-
20
-
-
0028423534
-
Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information
-
Sastry, P., Phansalkar, V., & Thathachar, M. (1994). Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information. IEEE Transactions on Systems, Man, and Cybernetics, 24(5), 769-777.
-
(1994)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.24
, Issue.5
, pp. 769-777
-
-
Sastry, P.1
Phansalkar, V.2
Thathachar, M.3
-
22
-
-
0036894214
-
Varieties of learning automata: An overview
-
Thathachar, M., & Sastry, P. (2002). Varieties of learning automata: An overview. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 32(6), 711-722.
-
(2002)
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
, vol.32
, Issue.6
, pp. 711-722
-
-
Thathachar, M.1
Sastry, P.2
-
23
-
-
0028497630
-
Asynchronous stochastic approximation and q-learning
-
Tsitsiklis, J. (1994). Asynchronous stochastic approximation and q-learning. Machine Learning, 16, 185-202.
-
(1994)
Machine Learning
, vol.16
, pp. 185-202
-
-
Tsitsiklis, J.1
-
26
-
-
7044229393
-
Homo egualis reinforcement learning agents for load balancing
-
Proceedings of the 1st NASA workshop on radical agent concepts, pp, Springer-Verlag
-
Verbeeck, K., Nowé, A., & Parent, J. (2002). Homo egualis reinforcement learning agents for load balancing. In Proceedings of the 1st NASA workshop on radical agent concepts, pp. 81-91. Springer-Verlag LNAI 2564.
-
(2002)
LNAI
, vol.2564
, pp. 81-91
-
-
Verbeeck, K.1
Nowé, A.2
Parent, J.3
|