-
2
-
-
84899963942
-
Social reward shaping in the prisoner's dilemma
-
M. Babes, E. de Cote, and M. Littman. Social reward shaping in the prisoner's dilemma. In Proceedings of The Seventh Annual International Conference on Autonomous Agents and Multiagent Systems, Volume 3, pages 1389-1392, 2008.
-
(2008)
Proceedings of the Seventh Annual International Conference on Autonomous Agents and Multiagent Systems
, vol.3
, pp. 1389-1392
-
-
Babes, M.1
De Cote, E.2
Littman, M.3
-
5
-
-
40949147745
-
A comprehensive survey of MultiAgent reinforcement learning
-
L. Busoniu, R. Babuska, and B. De Schutter. A Comprehensive Survey of MultiAgent Reinforcement Learning. IEEE Transactions on Systems Man & Cybernetics Part C Applications and Reviews, 38(2):156, 2008.
-
(2008)
IEEE Transactions on Systems Man & Cybernetics Part C Applications and Reviews
, vol.38
, Issue.2
, pp. 156
-
-
Busoniu, L.1
Babuska, R.2
De Schutter, B.3
-
7
-
-
79955403826
-
An empirical study of potential-based reward shaping and advice in complex, multi-agent systems
-
S. Devlin, M. Grzes, and D. Kudenko. An empirical study of potential-based reward shaping and advice in complex, multi-agent systems. Advances in Complex Systems, 2011.
-
(2011)
Advances in Complex Systems
-
-
Devlin, S.1
Grzes, M.2
Kudenko, D.3
-
11
-
-
77950298151
-
Online learning of shaping rewards in reinforcement learning
-
M. Grzes and D. Kudenko. Online learning of shaping rewards in reinforcement learning. Artificial Neural Networks-ICANN 2010, pages 541-550, 2010.
-
(2010)
Artificial Neural Networks-ICANN 2010
, pp. 541-550
-
-
Grzes, M.1
Kudenko, D.2
-
14
-
-
0030647149
-
Reinforcement learning in the multi-robot domain
-
M. Mataric. Reinforcement learning in the multi-robot domain. Autonomous Robots, 4(1):73-83, 1997.
-
(1997)
Autonomous Robots
, vol.4
, Issue.1
, pp. 73-83
-
-
Mataric, M.1
-
16
-
-
0003998452
-
-
John Wiley & Sons, Inc., New York, NY, USA
-
M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc., New York, NY, USA, 1994.
-
(1994)
Markov Decision Processes: Discrete Stochastic Dynamic Programming
-
-
Puterman, M.L.1
-
18
-
-
34147161536
-
If multi-agent learning is the answer, what is the question?
-
Y. Shoham, R. Powers, and T. Grenager. If multi-agent learning is the answer, what is the question? Artificial Intelligence, 171(7):365-377, 2007.
-
(2007)
Artificial Intelligence
, vol.171
, Issue.7
, pp. 365-377
-
-
Shoham, Y.1
Powers, R.2
Grenager, T.3
-
23
-
-
27344453198
-
Potential-based shaping and Q-value initialization are equivalent
-
E. Wiewiora. Potential-based shaping and Q-value initialization are equivalent. Journal of Artificial Intelligence Research, 19(1):205-208, 2003.
-
(2003)
Journal of Artificial Intelligence Research
, vol.19
, Issue.1
, pp. 205-208
-
-
Wiewiora, E.1
|