-
4
-
-
0031189914
-
Multitask learning
-
July
-
Rich Caruana. Multitask learning. Mach. Learn., 28(1):41–75, July 1997.
-
(1997)
Mach. Learn.
, vol.28
, Issue.1
, pp. 41-75
-
-
Caruana, R.1
-
7
-
-
0022681148
-
How not to lie with statistics: The correct way to summarize benchmark results
-
March
-
Philip J. Fleming and John J. Wallace. How not to lie with statistics: The correct way to summarize benchmark results. Commun. ACM, 29(3):218–221, March 1986.
-
(1986)
Commun. ACM
, vol.29
, Issue.3
, pp. 218-221
-
-
Fleming, P.J.1
Wallace, J.J.2
-
9
-
-
84937779024
-
Deep learning for real-time atari game play using offline monte-carlo tree search planning
-
Xiaoxiao Guo, Satinder P. Singh, Honglak Lee, Richard L. Lewis, and Xiaoshi Wang. Deep learning for real-time atari game play using offline monte-carlo tree search planning. In Advances in Neural Information Processing Systems (NIPS), pages 3338–3346, 2014.
-
(2014)
Advances in Neural Information Processing Systems (NIPS)
, pp. 3338-3346
-
-
Guo, X.1
Singh, S.P.2
Lee, H.3
Lewis, R.L.4
Wang, X.5
-
11
-
-
33750293964
-
Bandit based monte-carlo planning
-
Springer
-
Levente Kocsis and Csaba Szepesvári. Bandit based monte-carlo planning. In Machine Learning: ECML 2006, pages 282–293. Springer, 2006.
-
(2006)
Machine Learning: ECML 2006
, pp. 282-293
-
-
Kocsis, L.1
Szepesvári, C.2
-
13
-
-
84910035297
-
Learning small-size dnn with output-distribution-based criteria
-
Jinyu Li, Rui Zhao, Jui-Ting Huang, and Yifan Gong. Learning small-size dnn with output-distribution-based criteria. In Proc. Interspeech, 2014.
-
(2014)
Proc. Interspeech
-
-
Li, J.1
Zhao, R.2
Huang, J.-T.3
Gong, Y.4
-
15
-
-
65249159583
-
Improving supervised learning by adapting the problem to the learner
-
Joshua Menke and Tony Martinez. Improving supervised learning by adapting the problem to the learner. International Journal of Neural Systems, 19(01):1–9, 2009.
-
(2009)
International Journal of Neural Systems
, vol.19
, Issue.1
, pp. 1-9
-
-
Menke, J.1
Martinez, T.2
-
16
-
-
84904867557
-
Playing atari with deep reinforcement learning
-
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin A. Riedmiller. Playing atari with deep reinforcement learning. Deep Learning Workshop, NIPS, 2013.
-
(2013)
Deep Learning Workshop, NIPS
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Graves, A.4
Antonoglou, I.5
Wierstra, D.6
Riedmiller, M.A.7
-
17
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
02
-
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hass-abis. Human-level control through deep reinforcement learning. Nature, 518(7540):529–533, 02 2015.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
Petersen, S.11
Beattie, C.12
Sadik, A.13
Antonoglou, I.14
King, H.15
Kumaran, D.16
Wierstra, D.17
Legg, S.18
Hassabis, D.19
-
18
-
-
85007207440
-
Massively parallel methods for deep reinforcement learning
-
Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, Shane Legg, Volodymyr Mnih, Koray Kavukcuoglu, and David Silver. Massively parallel methods for deep reinforcement learning. CoRR, abs/1507.04296, 2015.
-
(2015)
CoRR
-
-
Nair, A.1
Srinivasan, P.2
Blackwell, S.3
Alcicek, C.4
Fearon, R.5
De Maria, A.6
Panneershelvam, V.7
Suleyman, M.8
Beattie, C.9
Petersen, S.10
Legg, S.11
Mnih, V.12
Kavukcuoglu, K.13
Silver, D.14
-
19
-
-
84964544562
-
-
arXiv preprint
-
Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta, and Yoshua Bengio. Fitnets: Hints for thin deep nets. arXiv preprint arXiv:1412.6550, 2014.
-
(2014)
Fitnets: Hints for Thin Deep Nets
-
-
Romero, A.1
Ballas, N.2
Kahou, S.E.3
Chassang, A.4
Gatta, C.5
Bengio, Y.6
-
22
-
-
0004102479
-
-
MIT Press, Cambridge, MA, USA, 1st edition
-
Richard S. Sutton and Andrew G. Barto. Introduction to Reinforcement Learning. MIT Press, Cambridge, MA, USA, 1st edition, 1998.
-
(1998)
Introduction to Reinforcement Learning
-
-
Sutton, R.S.1
Barto, A.G.2
-
27
-
-
84986224754
-
-
arXiv preprint
-
Dong Wang, Chao Liu, Zhiyuan Tang, Zhiyong Zhang, and Mengyuan Zhao. Recurrent neural network training with dark knowledge transfer. arXiv preprint arXiv:1505.04630, 2015.
-
(2015)
Recurrent Neural Network Training with Dark Knowledge Transfer
-
-
Wang, D.1
Liu, C.2
Tang, Z.3
Zhang, Z.4
Zhao, M.5
|