-
2
-
-
84940795121
-
Memory trace replay: The shaping of memory consolidation by neuromodulation
-
Atherton, Laura A, Dupret, David, and Mellor, Jack R. Memory trace replay: the shaping of memory consolidation by neuromodulation. Trends in neurosciences, 38(9):560–570, 2015.
-
(2015)
Trends in Neurosciences
, vol.38
, Issue.9
, pp. 560-570
-
-
Atherton, L.A.1
Dupret, D.2
Mellor, J.R.3
-
3
-
-
84879678310
-
-
arXiv preprint
-
Bellemare, Marc G, Naddaf, Yavar, Veness, Joel, and Bowling, Michael. The arcade learning environment: An evaluation platform for general agents. arXiv preprint arXiv:1207.4708, 2012.
-
(2012)
The Arcade Learning Environment: An Evaluation Platform for General Agents
-
-
Bellemare, M.G.1
Naddaf, Y.2
Veness, J.3
Bowling, M.4
-
4
-
-
85007236718
-
Increasing the action gap: New operators for reinforcement learning
-
Bellemare, Marc G., Ostrovski, Georg, Guez, Arthur, Thomas, Philip S., and Munos, Rémi. Increasing the action gap: New operators for reinforcement learning. In Proceedings of the AAAI Conference on Artificial Intelligence, 2016. URL http://arxiv.org/abs/1512.04860.
-
(2016)
Proceedings of the AAAI Conference on Artificial Intelligence
-
-
Bellemare, M.G.1
Ostrovski, G.2
Guez, A.3
Thomas, P.S.4
Munos, R.5
-
5
-
-
84888340666
-
Torch7: A matlab-like environment for machine learning
-
number EPFL-CONF-192376
-
Collobert, Ronan, Kavukcuoglu, Koray, and Farabet, Clément. Torch7: A matlab-like environment for machine learning. In BigLearn, NIPS Workshop, number EPFL-CONF-192376, 2011.
-
(2011)
BigLearn, NIPS Workshop
-
-
Collobert, R.1
Kavukcuoglu, K.2
Farabet, C.3
-
6
-
-
21844491206
-
Zebras and the Anna Karenina principle
-
Diamond, Jared. Zebras and the Anna Karenina principle. Natural History, 103:4–4, 1994.
-
(1994)
Natural History
, vol.103
, pp. 4
-
-
Diamond, J.1
-
7
-
-
51949101231
-
A discriminatively trained, multi-scale, deformable part model
-
Felzenszwalb, Pedro, McAllester, David, and Ramanan, Deva. A discriminatively trained, multi-scale, deformable part model. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pp. 1–8. IEEE, 2008.
-
(2008)
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
, pp. 1-8
-
-
Felzenszwalb, P.1
McAllester, D.2
Ramanan, D.3
-
8
-
-
33645458694
-
Reverse replay of behavioural sequences in hippocampal place cells during the awake state
-
Foster, David J and Wilson, Matthew A. Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature, 440(7084):680–683, 2006.
-
(2006)
Nature
, vol.440
, Issue.7084
, pp. 680-683
-
-
Foster, D.J.1
Wilson, M.A.2
-
9
-
-
84862515469
-
A review on ensembles for the class imbalance problem: Bagging-, boosting-, and hybrid-based approaches
-
Galar, Mikel, Fernandez, Alberto, Barrenechea, Edurne, Bustince, Humberto, and Herrera, Francisco. A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, 42(4):463–484, 2012.
-
(2012)
Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
, vol.42
, Issue.4
, pp. 463-484
-
-
Galar, M.1
Fernandez, A.2
Barrenechea, E.3
Bustince, H.4
Herrera, F.5
-
10
-
-
80053456360
-
Online discovery of feature dependencies
-
Geramifard, Alborz, Doshi, Finale, Redding, Joshua, Roy, Nicholas, and How, Jonathan. Online discovery of feature dependencies. In Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp. 881–888, 2011.
-
(2011)
Proceedings of the 28th International Conference on Machine Learning (ICML-11)
, pp. 881-888
-
-
Geramifard, A.1
Doshi, F.2
Redding, J.3
Roy, N.4
How, J.5
-
11
-
-
84937779024
-
Deep learning for real-time atari game play using offline Monte-carlo tree search planning
-
Ghahra-mani, Z., Welling, M., Cortes, C., Lawrence, and Weinberger, K.Q. (eds), Curran Associates, Inc
-
Guo, Xiaoxiao, Singh, Satinder, Lee, Honglak, Lewis, Richard L, and Wang, Xiaoshi. Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning. In Ghahra-mani, Z., Welling, M., Cortes, C., Lawrence, N.D., and Weinberger, K.Q. (eds.), Advances in Neural Information Processing Systems 27, pp. 3338–3346. Curran Associates, Inc., 2014.
-
(2014)
Advances in Neural Information Processing Systems
, vol.27
, pp. 3338-3346
-
-
Guo, X.1
Singh, S.2
Lee, H.3
Lewis, R.L.4
Wang, X.5
-
12
-
-
34848816179
-
To recognize shapes, first learn to generate images
-
Hinton, Geoffrey E. To recognize shapes, first learn to generate images. Progress in brain research, 165:535–547, 2007.
-
(2007)
Progress in Brain Research
, vol.165
, pp. 535-547
-
-
Hinton, G.E.1
-
13
-
-
85083951076
-
ADaM: A method for stochastic optimization
-
Kingma, Diederik P. and Ba, Jimmy. Adam: A method for stochastic optimization. CoRR, abs/1412.6980, 2014.
-
(2014)
CoRR
-
-
Kingma, D.P.1
Ba, J.2
-
14
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Nov
-
Lecun, Y., Bottou, L., Bengio, Y., and Haffner, P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, Nov 1998. ISSN 0018-9219. doi: 10.1109/5.726791.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
16
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
Lin, Long-Ji. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine learning, 8(3-4):293–321, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 293-321
-
-
Lin, L.-J.1
-
17
-
-
84937883130
-
Weighted importance sampling for off-policy learning with linear function approximation
-
Mahmood, A Rupam, van Hasselt, Hado P, and Sutton, Richard S. Weighted importance sampling for off-policy learning with linear function approximation. In Advances in Neural Information Processing Systems, pp. 3014–3022, 2014.
-
(2014)
Advances in Neural Information Processing Systems
, pp. 3014-3022
-
-
Mahmood, A.R.1
Van Hasselt, H.P.2
Sutton, R.S.3
-
18
-
-
84947899563
-
Dopaminergic neurons promote hippocampal reactivation and spatial memory persistence
-
McNamara, Colin G, Tejero-Cantero, Álvaro, Trouche, Stéphanie, Campo-Urriza, Natalia, and Dupret, David. Dopaminergic neurons promote hippocampal reactivation and spatial memory persistence. Nature neuroscience, 2014.
-
(2014)
Nature Neuroscience
-
-
McNamara, C.G.1
Tejero-Cantero, Á.2
Trouche, S.3
Campo-Urriza, N.4
Dupret, D.5
-
19
-
-
84904867557
-
-
arXiv preprint
-
Mnih, Volodymyr, Kavukcuoglu, Koray, Silver, David, Graves, Alex, Antonoglou, Ioannis, Wierstra, Daan, and Riedmiller, Martin. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602, 2013.
-
(2013)
Playing Atari with Deep Reinforcement Learning
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Graves, A.4
Antonoglou, I.5
Wierstra, D.6
Riedmiller, M.7
-
20
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
Mnih, Volodymyr, Kavukcuoglu, Koray, Silver, David, Rusu, Andrei A, Veness, Joel, Bellemare, Marc G, Graves, Alex, Riedmiller, Martin, Fidjeland, Andreas K, Ostrovski, Georg, Petersen, Stig, Beattie, Charles, Sadik, Amir, Antonoglou, Ioannis, King, Helen, Kumaran, Dharshan, Wierstra, Daan, Legg, Shane, and Hassabis, Demis. Human-level control through deep reinforcement learning. Nature, 518(7540):529–533, 2015.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
Petersen, S.11
Beattie, C.12
Sadik, A.13
Antonoglou, I.14
King, H.15
Kumaran, D.16
Wierstra, D.17
Legg, S.18
Hassabis, D.19
-
21
-
-
0027684215
-
Prioritized sweeping: Reinforcement learning with less data and less time
-
Moore, Andrew W and Atkeson, Christopher G. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13(1):103–130, 1993.
-
(1993)
Machine Learning
, vol.13
, Issue.1
, pp. 103-130
-
-
Moore, A.W.1
Atkeson, C.G.2
-
22
-
-
84980007683
-
-
arXiv preprint
-
Nair, Arun, Srinivasan, Praveen, Blackwell, Sam, Alcicek, Cagdas, Fearon, Rory, Maria, Alessan-dro De, Panneershelvam, Vedavyas, Suleyman, Mustafa, Beattie, Charles, Petersen, Stig, Legg, Shane, Mnih, Volodymyr, Kavukcuoglu, Koray, and Silver, David. Massively parallel methods for deep reinforcement learning. arXiv preprint arXiv:1507.04296, 2015.
-
(2015)
Massively Parallel Methods for Deep Reinforcement Learning
-
-
Nair, A.1
Srinivasan, P.2
Blackwell, S.3
Alcicek, C.4
Fearon, R.5
De Maria, A.-D.6
Panneershelvam, V.7
Suleyman, M.8
Beattie, C.9
Petersen, S.10
Legg, S.11
Mnih, V.12
Kavukcuoglu, K.13
Silver, D.14
-
24
-
-
84937060789
-
Hippocampal place cells construct reward related sequences through unexplored space
-
Ólafsdóttir, H Freyja, Barry, Caswell, Saleem, Aman B, Hassabis, Demis, and Spiers, Hugo J. Hippocampal place cells construct reward related sequences through unexplored space. Elife, 4: e06063, 2015.
-
(2015)
Elife
, vol.4
-
-
Ólafsdóttir, H.F.1
Barry, C.2
Saleem, A.B.3
Hassabis, D.4
Spiers, H.J.5
-
26
-
-
0031082536
-
New methods for competitive coevolution
-
Rosin, Christopher D and Belew, Richard K. New methods for competitive coevolution. Evolutionary Computation, 5(1):1–29, 1997.
-
(1997)
Evolutionary Computation
, vol.5
, Issue.1
, pp. 1-29
-
-
Rosin, C.D.1
Belew, R.K.2
-
27
-
-
84897487847
-
No more pesky learning rates
-
Schaul, Tom, Zhang, Sixin, and Lecun, Yann. No more pesky learning rates. In Proceedings of the 30th International Conference on Machine Learning (ICML-13), pp. 343–351, 2013.
-
(2013)
Proceedings of the 30th International Conference on Machine Learning (ICML-13)
, pp. 343-351
-
-
Schaul, T.1
Zhang, S.2
Lecun, Y.3
-
29
-
-
72149101860
-
Rewarded outcomes enhance reactivation of experience in the hippocampus
-
Singer, Annabelle C and Frank, Loren M. Rewarded outcomes enhance reactivation of experience in the hippocampus. Neuron, 64(6):910–921, 2009.
-
(2009)
Neuron
, vol.64
, Issue.6
, pp. 910-921
-
-
Singer, A.C.1
Frank, L.M.2
-
31
-
-
80053457849
-
Incremental basis construction from temporal difference error
-
Sun, Yi, Ring, Mark, Schmidhuber, Jürgen, and Gomez, Faustino J. Incremental basis construction from temporal difference error. In Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp. 481–488, 2011.
-
(2011)
Proceedings of the 28th International Conference on Machine Learning (ICML-11)
, pp. 481-488
-
-
Sun, Y.1
Ring, M.2
Schmidhuber, J.3
Gomez, F.J.4
|