-
1
-
-
52249098423
-
Optimal and approximate Q-value functions for decentralized POMDPs
-
F. A. Oliehoek, M. T. J. Spaan, and N. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. JAIR, 32:289-353, 2008.
-
(2008)
JAIR
, vol.32
, pp. 289-353
-
-
Oliehoek, F.A.1
Spaan, M.T.J.2
Vlassis, N.3
-
2
-
-
84962082047
-
Multi-agent reinforcement learning as a rehearsal for decentralized planning
-
L. Kraemer and B. Banerjee. Multi-agent reinforcement learning as a rehearsal for decentralized planning. Neurocomputing, 190:82-94, 2016.
-
(2016)
Neurocomputing
, vol.190
, pp. 82-94
-
-
Kraemer, L.1
Banerjee, B.2
-
3
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis. Human-level control through deep reinforcement learning. Nature, 518(7540):529-533, 2015.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
Petersen, S.11
Beattie, C.12
Sadik, A.13
Antonoglou, I.14
King, H.15
Kumaran, D.16
Wierstra, D.17
Legg, S.18
Hassabis, D.19
-
4
-
-
85152198941
-
Multi-agent reinforcement learning: Independent vs. Cooperative agents
-
M. Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In ICML, 1993.
-
(1993)
ICML
-
-
Tan, M.1
-
5
-
-
84868340899
-
QueryPOMDP: POMDP-based communication in multiagent systems
-
F. S. Melo, M. Spaan, and S. J. Witwicki. QueryPOMDP: POMDP-based communication in multiagent systems. In Multi-Agent Systems, pages 189-204. 2011.
-
(2011)
Multi-Agent Systems
, pp. 189-204
-
-
Melo, F.S.1
Spaan, M.2
Witwicki, S.J.3
-
6
-
-
26444601262
-
Cooperative multi-agent learning: The state of the art
-
L. Panait and S. Luke. Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems, 11(3):387-434, 2005.
-
(2005)
Autonomous Agents and Multi-Agent Systems
, vol.11
, Issue.3
, pp. 387-434
-
-
Panait, L.1
Luke, S.2
-
12
-
-
79961219393
-
Discovering binary codes for documents by learning deep generative models
-
G. Hinton and R. Salakhutdinov. Discovering binary codes for documents by learning deep generative models. Topics in Cognitive Science, 3(1):74-91, 2011.
-
(2011)
Topics in Cognitive Science
, vol.3
, Issue.1
, pp. 74-91
-
-
Hinton, G.1
Salakhutdinov, R.2
-
14
-
-
84998600495
-
-
arXiv preprint arXiv:1511.08779
-
A. Tampuu, T. Matiisen, D. Kodelja, I. Kuzovkin, K. Korjus, J. Aru, J. Aru, and R. Vicente. Multiagent cooperation and competition with deep reinforcement learning. arXiv preprint arXiv:1511.08779, 2015.
-
(2015)
Multiagent Cooperation and Competition with Deep Reinforcement Learning
-
-
Tampuu, A.1
Matiisen, T.2
Kodelja, D.3
Kuzovkin, I.4
Korjus, K.5
Aru, J.6
Aru, J.7
Vicente, R.8
-
19
-
-
84893343292
-
Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
-
T. Tieleman and G. Hinton. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4(2), 2012.
-
(2012)
COURSERA: Neural Networks for Machine Learning
, vol.4
, Issue.2
-
-
Tieleman, T.1
Hinton, G.2
-
23
-
-
84969584486
-
Batch normalization: Accelerating deep network training by reducing internal covariate shift
-
S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML, pages 448-456, 2015.
-
(2015)
ICML
, pp. 448-456
-
-
Ioffe, S.1
Szegedy, C.2
-
25
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
26
-
-
33746382692
-
How did language go discrete?
-
M. Tallerman, editor chapter 3. Oxford University Press
-
M. Studdert-Kennedy. How did language go discrete? In M. Tallerman, editor, Language Origins: Perspectives on Evolution, chapter 3. Oxford University Press, 2005.
-
(2005)
Language Origins: Perspectives on Evolution
-
-
Studdert-Kennedy, M.1
|