-
2
-
-
0031281590
-
Learning through reinforcement and replicator dynamics
-
Borgers, T., & Sarin, R. (1997). Learning through reinforcement and replicator dynamics. Journal of Economic Theory, 77, 1-14.
-
(1997)
Journal of Economic Theory
, vol.77
, pp. 1-14
-
-
Borgers, T.1
Sarin, R.2
-
3
-
-
0033876515
-
-
Borkar, V. S., & Meyn, S. P. (2000). The O.D.E. method for convergence of stochastic approximation and reinforcement learning. SIAM Journal on Control and Optimization, 38, 447-469.
-
Borkar, V. S., & Meyn, S. P. (2000). The O.D.E. method for convergence of stochastic approximation and reinforcement learning. SIAM Journal on Control and Optimization, 38, 447-469.
-
-
-
-
7
-
-
4544270261
-
Resource allocation in the grid using reinforcement learning
-
Washington, DC, USA: IEEE Computer Society
-
Galstyan, A., Czajkowski, K., & Lerman, K. (2004). Resource allocation in the grid using reinforcement learning. Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS'04) (pp. 1314-1315). Washington, DC, USA: IEEE Computer Society.
-
(2004)
Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS'04)
, pp. 1314-1315
-
-
Galstyan, A.1
Czajkowski, K.2
Lerman, K.3
-
8
-
-
44149102716
-
Learning the ipa market with individual and social rewards
-
Fremont, CA, USA: IEEE Computer Society
-
Gomes, E. R., & Kowalczyk, R. (2007). Learning the ipa market with individual and social rewards. Proceedings of the International Conference on Intelligent Agent Technology (IAT'07) (pp. 328-334). Fremont, CA, USA: IEEE Computer Society.
-
(2007)
Proceedings of the International Conference on Intelligent Agent Technology (IAT'07)
, pp. 328-334
-
-
Gomes, E.R.1
Kowalczyk, R.2
-
9
-
-
58149457775
-
Optimal local basis: A reinforcement learning approach for face recognition
-
Harandi, M. T., Ahmadabadi, M. N., & Araabi, B. N. (2008). Optimal local basis: A reinforcement learning approach for face recognition. International Journal of Computer Vision, 81, 191-204.
-
(2008)
International Journal of Computer Vision
, vol.81
, pp. 191-204
-
-
Harandi, M.T.1
Ahmadabadi, M.N.2
Araabi, B.N.3
-
11
-
-
70049111791
-
Learning teaching strategies in an adaptive and intelligent educational system through reinforcement learning
-
in press
-
Iglesias, A., Martnez, P., Aler, R., & Fernndez, F. (2008). Learning teaching strategies in an adaptive and intelligent educational system through reinforcement learning. Applied Intelligence, in press.
-
(2008)
Applied Intelligence
-
-
Iglesias, A.1
Martnez, P.2
Aler, R.3
Fernndez, F.4
-
14
-
-
41549123971
-
Theoretical advantages of lenient learners: An evolutionary game theoretic perspective
-
Panait, L., Tuyls, K., & Luke, S. (2008). Theoretical advantages of lenient learners: An evolutionary game theoretic perspective. Journal of Machine Learning Research, 9, 423-457.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 423-457
-
-
Panait, L.1
Tuyls, K.2
Luke, S.3
-
16
-
-
1142268235
-
A selection-mutation model for Q-learning in multi-agent systems
-
New York, NY, USA: ACM
-
Tuyls, K., Verbeeck, K., & Lenaerts, T. (2003). A selection-mutation model for Q-learning in multi-agent systems. Proceedings of the 2nd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS'03) (pp. 693-700). New York, NY, USA: ACM.
-
(2003)
Proceedings of the 2nd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS'03)
, pp. 693-700
-
-
Tuyls, K.1
Verbeeck, K.2
Lenaerts, T.3
-
17
-
-
0346502047
-
Predicting the expected behavior of agents that learn about agents: The CLRI framework
-
Vidal, J. M., & Durfee, E. H. (2003). Predicting the expected behavior of agents that learn about agents: the CLRI framework. Autonomous Agents and Multi-Agent Systems, 6, 77-107.
-
(2003)
Autonomous Agents and Multi-Agent Systems
, vol.6
, pp. 77-107
-
-
Vidal, J.M.1
Durfee, E.H.2
-
18
-
-
50849089602
-
A reinforcement learning algorithm for market participants in FTR auctions
-
IEEE
-
Ziogos, N. P., Tellidou, A. C., Gountis, V. P., & Bakirtzis, A. G. (2007). A reinforcement learning algorithm for market participants in FTR auctions. Proceedings of the Seventh IEEE Power Tech (pp. 943-948). IEEE.
-
(2007)
Proceedings of the Seventh IEEE Power Tech
, pp. 943-948
-
-
Ziogos, N.P.1
Tellidou, A.C.2
Gountis, V.P.3
Bakirtzis, A.G.4
|