-
1
-
-
84899963942
-
Social reward shaping in the prisoner's dilemma
-
Babes, M., de Cote, E. and Littman, M., Social reward shaping in the prisoner's dilemma, in Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 3 (2008), pp. 1389-1392.
-
(2008)
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems
, vol.3
, pp. 1389-1392
-
-
Babes, M.1
De Cote, E.2
Littman, M.3
-
5
-
-
40949147745
-
A comprehensive survey of multi-agent reinforcement learning
-
Busoniu, L., Babuska, R. and De Schutter, B., A comprehensive survey of multi-agent reinforcement learning, IEEE Trans. Syst. Man Cyb. C. 38 (2008) 156.
-
(2008)
IEEE Trans. Syst. Man Cyb. C.
, vol.38
, pp. 156
-
-
Busoniu, L.1
Babuska, R.2
De Schutter, B.3
-
7
-
-
84856908031
-
Reinforcement learning in robocup keepaway with partial observability
-
Devlin, S., Grzés, M. and Kudenko, D., Reinforcement learning in robocup keepaway with partial observability, in IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2009. WI-IAT'09 (2009).
-
(2009)
IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2009. WI-IAT'09
-
-
Devlin, S.1
Grzés, M.2
Kudenko, D.3
-
10
-
-
0004260007
-
-
MIT Press, Cambridge, MA
-
Fudenberg, D. and Tirole, J., Game Theory (MIT Press, Cambridge, MA, 1991).
-
(1991)
Game Theory
-
-
Fudenberg, D.1
Tirole, J.2
-
13
-
-
4644369748
-
Nash Q-learning for general-sum stochastic games
-
Hu, J. andWellman, M., Nash Q-learning for general-sum stochastic games, J. Mach. Learn. Res. 4 (2003) 1039-1069.
-
(2003)
J. Mach. Learn. Res.
, vol.4
, pp. 1039-1069
-
-
Hu, J.1
Wellman, M.2
-
14
-
-
84899897564
-
A new perspective to the keepaway soccer: The takers
-
Iscen, A. and Erogul, U., A new perspective to the keepaway soccer: the takers, in Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 3 (2008), pp. 1341-1344.
-
(2008)
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems
, vol.3
, pp. 1341-1344
-
-
Iscen, A.1
Erogul, U.2
-
15
-
-
77950988223
-
Learning complementary multiagent behaviors: A case study
-
RoboCup 2009: Robot Soccer World Cup XIII, eds. Baltes, J., Lagoudakis, M., Naruse, T. and Ghidary, S., Springer Berlin/Heidelberg
-
Kalyanakrishnan, S. and Stone, P., Learning complementary multiagent behaviors: A case study, in RoboCup 2009: Robot Soccer World Cup XIII, eds. Baltes, J., Lagoudakis, M., Naruse, T. and Ghidary, S., Lecture Notes in Computer Science, Vol. 5949 (Springer Berlin/Heidelberg, 2010), pp. 153-165.
-
(2010)
Lecture Notes in Computer Science
, vol.5949
, pp. 153-165
-
-
Kalyanakrishnan, S.1
Stone, P.2
-
16
-
-
0029732210
-
Creating advice-taking reinforcement learners
-
Maclin, R. and Shavlik, J., Creating advice-taking reinforcement learners, Lect. Notes Artif. Int. (1996) 251-281. (Pubitemid 126724368)
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 251-281
-
-
Maclin, R.1
Shavlik, J.W.2
-
18
-
-
77950915046
-
Decentralized learning in wireless sensor networks
-
Mihaylov, M., Tuyls, K. and Noẃe, A., Decentralized learning in wireless sensor networks, Adaptive and Learning Agents (2009), pp. 60-73.
-
(2009)
Adaptive and Learning Agents
, pp. 60-73
-
-
Mihaylov, M.1
Tuyls, K.2
Noẃe, A.3
-
19
-
-
62949148941
-
A study of reinforcement learning in a new multiagent domain
-
Min, H., Zeng, J., Chen, J. and Zhu, J., A Study of reinforcement learning in a new multiagent domain, in IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT'08, Vol. 2 (2008).
-
(2008)
IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT'08
, vol.2
-
-
Min, H.1
Zeng, J.2
Chen, J.3
Zhu, J.4
-
20
-
-
0001730497
-
Non-cooperative games
-
Nash, J., Non-cooperative games, Ann. Math. 54 (1951) 286-295.
-
(1951)
Ann. Math.
, vol.54
, pp. 286-295
-
-
Nash, J.1
-
21
-
-
0141596576
-
Policy invariance under reward transformations: Theory and application to reward shaping
-
Ng, A. Y., Harada, D. and Russell, S. J., Policy invariance under reward transformations: Theory and application to reward shaping, in Proceedings of the 16th International Conference on Machine Learning (1999), pp. 278-287.
-
(1999)
Proceedings of the 16th International Conference on Machine Learning
, pp. 278-287
-
-
Ng, A.Y.1
Harada, D.2
Russell, S.J.3
-
22
-
-
34447553096
-
Reinforcement learning for humanoid robotics
-
Peters, J., Vijayakumar, S. and Schaal, S., Reinforcement learning for humanoid robotics, in Proceedings of Humanoids2003, Third IEEE-RAS International Conference on Humanoid Robots (2003).
-
(2003)
Proceedings of Humanoids2003, Third IEEE-RAS International Conference on Humanoid Robots
-
-
Peters, J.1
Vijayakumar, S.2
Schaal, S.3
-
23
-
-
0003998452
-
-
John Wiley & Sons, Inc., New York, NY, USA
-
Puterman, M. L., Markov Decision Processes: Discrete Stochastic Dynamic Programming (John Wiley & Sons, Inc., New York, NY, USA, 1994).
-
(1994)
Markov Decision Processes: Discrete Stochastic Dynamic Programming
-
-
Puterman, M.L.1
-
25
-
-
34147161536
-
If multi-agent learning is the answer, what is the question?
-
DOI 10.1016/j.artint.2006.02.006, PII S0004370207000495, Foundations of Multi-Agent Learning
-
Shoham, Y., Powers, R. and Grenager, T., If multi-agent learning is the answer, what is the question? Artif. Intell. 171 (2007) 365-377. (Pubitemid 46802421)
-
(2007)
Artificial Intelligence
, vol.171
, Issue.7
, pp. 365-377
-
-
Shoham, Y.1
Powers, R.2
Grenager, T.3
-
26
-
-
37249034293
-
Keepaway soccer: From machine learning testbed to benchmark
-
RoboCup 2005: Robot Soccer World Cup IX
-
Stone, P., Kuhlmann, G., Taylor, M. E. and Liu, Y., Keepaway soccer: From machine learning testbed to benchmark, in RoboCup-2005: Robot Soccer World Cup IX, eds. Noda, I., Jacoff, A., Bredenfeld, A. and Takahashi, Y., Vol. 4020 (Springer-Verlag, Berlin, 2006), pp. 93-105. (Pubitemid 350278772)
-
(2006)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
, vol.4020 LNAI
, pp. 93-105
-
-
Stone, P.1
Kuhlmann, G.2
Taylor, M.E.3
Liu, Y.4
-
27
-
-
27544506565
-
Reinforcement learning for RoboCupsoccer keepaway
-
Stone, P., Sutton, R. S. and Kuhlmann, G., Reinforcement learning for RoboCupsoccer keepaway, Adapt. Behav. 13 (2005) 165-188.
-
(2005)
Adapt. Behav.
, vol.13
, pp. 165-188
-
-
Stone, P.1
Sutton, R.S.2
Kuhlmann, G.3
-
28
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
Sutton, R., Generalization in reinforcement learning: Successful examples using sparse coarse coding, Adv. Neur. In. (1996) 1038-1044.
-
(1996)
Adv. Neur. In.
, pp. 1038-1044
-
-
Sutton, R.1
-
29
-
-
0003617454
-
-
Ph.D. Thesis, Department of Computer Science, University of Massachusetts, Amherst
-
Sutton, R. S., Temporal Credit Assignment in Reinforcement Learning, Ph.D. Thesis, Department of Computer Science, University of Massachusetts, Amherst (1984).
-
(1984)
Temporal Credit Assignment in Reinforcement Learning
-
-
Sutton, R.S.1
-
32
-
-
70349592320
-
Learning from actions not taken in multiagent systems
-
Tumer, K. and Khani, N., Learning from actions not taken in multiagent systems, Adv. Complex Syst. 12 (2009) 455-473.
-
(2009)
Adv. Complex Syst.
, vol.12
, pp. 455-473
-
-
Tumer, K.1
Khani, N.2
-
34
-
-
27744448185
-
Reinforcement learning to play an optimal Nash equilibrium in team Markov games
-
Wang, X. and Sandholm, T., Reinforcement learning to play an optimal Nash equilibrium in team Markov games, Adv. Neur. In. (2003) 1603-1610.
-
(2003)
Adv. Neur. In.
, pp. 1603-1610
-
-
Wang, X.1
Sandholm, T.2
-
35
-
-
27344453198
-
Potential-based shaping and Q-value initialization are equivalent
-
Wiewiora, E., Potential-based shaping and Q-value initialization are equivalent, J. Artif. Intell. Res. 19 (2003) 205-208. (Pubitemid 41525920)
-
(2003)
Journal of Artificial Intelligence Research
, vol.19
, pp. 205-208
-
-
Wiewiora, E.1
-
36
-
-
0004320981
-
An introduction to collective intelligence
-
NASA Ames Research Center
-
Wolpert, D. and Tumer, K., An introduction to collective intelligence, Technical Report cs.LG/9908014, NASA Ames Research Center (1999).
-
(1999)
Technical Report cs.LG/9908014
-
-
Wolpert, D.1
Tumer, K.2
|