-
1
-
-
0001578518
-
A learning algorithm for boltzmann machines
-
Ackley, David H, Hinton, Geoffrey E, and Sejnowski, Terrence J. A learning algorithm for boltzmann machines. Cognitive science, 9(1): 147-169, 1985.
-
(1985)
Cognitive Science
, vol.9
, Issue.1
, pp. 147-169
-
-
Ackley, D.H.1
Hinton, G.E.2
Sejnowski, T.J.3
-
2
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
Bengio, Yoshua, Lamblin, Pascal, Popovici, Dan, Larochelle, Hugo, et al. Greedy layer-wise training of deep networks. In Advances in Neural Information Processing Systems (NIPS), 2007.
-
(2007)
Advances in Neural Information Processing Systems (NIPS)
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
3
-
-
84882266451
-
Better mixing via deep representations
-
Bengio, Yoshua, Mesnil, Grégoire, Dauphin, Yann, and Rifai, Salah. Better mixing via deep representations. In Proceedings of the 28th International Conference on Machine Learning (ICML), 2013a.
-
(2013)
Proceedings of the 28th International Conference on Machine Learning (ICML)
-
-
Bengio, Y.1
Mesnil, G.2
Dauphin, Y.3
Rifai, S.4
-
4
-
-
84899017362
-
Generalized denoising auto-encoders as generative models
-
Bengio, Yoshua, Yao, Li, Alain, Guillaume, and Vincent, Pascal. Generalized denoising auto-encoders as generative models. In Advances in Neural Information Processing Systems, pp. 899-907, 2013b.
-
(2013)
Advances in Neural Information Processing Systems
, pp. 899-907
-
-
Bengio, Y.1
Yao, L.2
Alain, G.3
Vincent, P.4
-
5
-
-
84919906761
-
Deep generative stochastic networks trainable by backprop
-
Bengio, Yoshua, Thibodeau-Laufer, Eric, Alain, Guillaume, and Yosinski, Jason. Deep generative stochastic networks trainable by backprop. In Proceedings of the 29th International Conference on Machine Learning (ICML), 2014.
-
(2014)
Proceedings of the 29th International Conference on Machine Learning (ICML)
-
-
Bengio, Y.1
Thibodeau-Laufer, E.2
Alain, G.3
Yosinski, J.4
-
6
-
-
84961291190
-
Learning phrase representations using rnn encoderdecoder for statistical machine translation
-
Cho, Kyunghyun, van Merrienboer, Bart, Gulcehre, Caglar, Bougares, Fethi, Schwenk, Holger, and Bengio, Yoshua. Learning phrase representations using rnn encoderdecoder for statistical machine translation. In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014.
-
(2014)
Conference on Empirical Methods in Natural Language Processing (EMNLP)
-
-
Cho, K.1
Van Merrienboer, B.2
Gulcehre, C.3
Bougares, F.4
Schwenk, H.5
Bengio, Y.6
-
8
-
-
84944115860
-
-
arXiv preprint arXiv:1411.4952
-
Fang, Hao, Gupta, Saurabh, Iandola, Forrest, Srivastava, Rupesh, Deng, Li, Dollár, Piotr, Gao, Jianfeng, He, Xiaodong, Mitchell, Margaret, Piatt, John, Zitnick, C. Lawrence, and Zweig, Geoffrey. From captions to visual concepts and back. arXiv preprint arXiv:1411.4952, 2014.
-
(2014)
From Captions to Visual Concepts and Back
-
-
Fang, H.1
Gupta, S.2
Iandola, F.3
Srivastava, R.4
Deng, L.5
Dollár, P.6
Gao, J.7
He, X.8
Mitchell, M.9
Piatt, J.10
Zitnick, C.L.11
Zweig, G.12
-
9
-
-
84937849144
-
Generative adversarial nets
-
Goodfellow, Ian, Pouget-Abadie, Jean, Mirza, Mehdi, Xu, Bing, Warde-Farley, David, Ozair, Sherjil, Courville, Aaron, and Bengio, Yoshua. Generative adversarial nets. In Advances in Neural Information Processing Systems, pp. 2672-2680, 2014.
-
(2014)
Advances in Neural Information Processing Systems
, pp. 2672-2680
-
-
Goodfellow, I.1
Pouget-Abadie, J.2
Mirza, M.3
Xu, B.4
Warde-Farley, D.5
Ozair, S.6
Courville, A.7
Bengio, Y.8
-
11
-
-
84864063983
-
A kernel method for the two-sample-problem
-
Gretton, Arthur, Borgwardt, Karsten M, Rasch, Malte, Schölkopf, Bernhard, and Smola, Alex J. A kernel method for the two-sample-problem. In Advances in Neural Information Processing Systems (NIPS), 2007.
-
(2007)
Advances in Neural Information Processing Systems (NIPS)
-
-
Gretton, A.1
Borgwardt, K.M.2
Rasch, M.3
Schölkopf, B.4
Smola, A.J.5
-
12
-
-
84859477054
-
A kernel two-sample test
-
Gretton, Arthur, Borgwardt, Karsten M, Rasch, Malte J, Schölkopf, Bernhard, and Smola, Alexander. A kernel two-sample test. The Journal of Machine Learning Research, 13(1):723-773, 2012a.
-
(2012)
The Journal of Machine Learning Research
, vol.13
, Issue.1
, pp. 723-773
-
-
Gretton, A.1
Borgwardt, K.M.2
Rasch, M.J.3
Schölkopf, B.4
Smola, A.5
-
13
-
-
84877753617
-
Optimal kernel choice for large-scale two-sample tests
-
Gretton, Arthur, Sejdinovic, Dino, Strathmann, Heiko, Balakrishnan, Sivaraman, Pontil, Massimiliano, Fukumizu, Kenji, and Sriperumbudur, Bharath K. Optimal kernel choice for large-scale two-sample tests. In Advances in Neural Information Processing Systems, pp. 1205-1213, 2012b.
-
(2012)
Advances in Neural Information Processing Systems
, pp. 1205-1213
-
-
Gretton, A.1
Sejdinovic, D.2
Strathmann, H.3
Balakrishnan, S.4
Pontil, M.5
Fukumizu, K.6
Sriperumbudur, B.K.7
-
14
-
-
0013344078
-
Training products of experts by minimizing contrastive divergence
-
Hinton, Geoffrey E. Training products of experts by minimizing contrastive divergence. Neural Computation, 14 (8):1771-1800, 2002.
-
(2002)
Neural Computation
, vol.14
, Issue.8
, pp. 1771-1800
-
-
Hinton, G.E.1
-
15
-
-
0029652445
-
The "wake-sleep" algorithm for unsupervised neural networks
-
Hinton, Geoffrey E, Dayan, Peter, Frey, Brendan J, and Neal, Radford M. The "wake-sleep" algorithm for unsupervised neural networks. Science, 268(5214):1158-1161, 1995.
-
(1995)
Science
, vol.268
, Issue.5214
, pp. 1158-1161
-
-
Hinton, G.E.1
Dayan, P.2
Frey, B.J.3
Neal, R.M.4
-
16
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
Hinton, Geoffrey E., Deng, Li, Yu, Dong, Dahl, George E., Mohamed, Abdel-Rahman, Jaitly, Navdeep, Senior, Andrew, Vanhoucke, Vincent, Nguyen, Patrick, Sainath, Tara N., and Kingsbury, Brian. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Process. Mag., 29(6):82-97, 2012a.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.E.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Kingsbury, B.11
-
17
-
-
84867720412
-
-
arXiv preprint arXiv:1207.0580
-
Hinton, Geoffrey E, Srivastava, Nitish, Krizhevsky, Alex, Sutskever, Ilya, and Salakhutdinov, Ruslan R. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580, 2012b.
-
(2012)
Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.R.5
-
22
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
LeCun, Yann, Bottou, Leon, Bengio, Yoshua, and Haffner, Patrick. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
23
-
-
33645246465
-
Bayesian neural networks and density networks
-
MacKay, David JC. Bayesian neural networks and density networks. Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, 354(1):73-80, 1995.
-
(1995)
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment
, vol.354
, Issue.1
, pp. 73-80
-
-
MacKay, D.J.C.1
-
24
-
-
84899007861
-
Neural networks for density estimation
-
Magdon-Ismail, Malik and Atiya, Amir. Neural networks for density estimation. In NIPS, pp. 522-528, 1998.
-
(1998)
NIPS
, pp. 522-528
-
-
Magdon-Ismail, M.1
Atiya, A.2
-
25
-
-
85006718556
-
A winner-take-all method for training sparse convolutional autoencoders
-
Makhzani, Alireza and Frey, Brendan. A winner-take-all method for training sparse convolutional autoencoders. In NIPS Deep Learning Workshop, 2014.
-
(2014)
NIPS Deep Learning Workshop
-
-
Makhzani, A.1
Frey, B.2
-
26
-
-
79959353548
-
Stacked convolutional auto-encoders for hierarchical feature extraction
-
Springer
-
Masci, Jonathan, Meier, Ueli, Cireşan, Dan, and Schmidhuber, Jürgen. Stacked convolutional auto-encoders for hierarchical feature extraction. In Artificial Neural Networks and Machine Learning-ICANN 2011, pp. 52-59. Springer, 2011.
-
(2011)
Artificial Neural Networks and Machine Learning-ICANN 2011
, pp. 52-59
-
-
Masci, J.1
Meier, U.2
Cireşan, D.3
Schmidhuber, J.4
-
29
-
-
44049116681
-
Connectionist learning of belief networks
-
Neal, Radford M. Connectionist learning of belief networks. Artificial intelligence, 56(1):71-113, 1992.
-
(1992)
Artificial Intelligence
, vol.56
, Issue.1
, pp. 71-113
-
-
Neal, R.M.1
-
31
-
-
84961233903
-
On the decreasing power of kernel and distance based nonparametric hypothesis tests in high dimensions
-
Ramdas, Aaditya, Reddi, Sashank J, Poczos, Barnabas, Singh, Aarti, and Wasserman, Larry. On the decreasing power of kernel and distance based nonparametric hypothesis tests in high dimensions. In The Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15), 2015.
-
(2015)
The Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15)
-
-
Ramdas, A.1
Reddi, S.J.2
Poczos, B.3
Singh, A.4
Wasserman, L.5
-
32
-
-
84919796093
-
Stochastic backpropagation and approximate inference in deep generative models
-
Rezende, Danilo Jimenez, Mohamed, Shakir, and Wierstra, Daan. Stochastic backpropagation and approximate inference in deep generative models. In International Conference on Machine Learning, pp. 1278-1286, 2014.
-
(2014)
International Conference on Machine Learning
, pp. 1278-1286
-
-
Rezende, D.J.1
Mohamed, S.2
Wierstra, D.3
-
33
-
-
80053460450
-
Contractive auto-encoders: Explicit invariance during feature extraction
-
Rifai, Salah, Vincent, Pascal, Muller, Xavier, Glorot, Xavier, and Bengio, Yoshua. Contractive auto-encoders: Explicit invariance during feature extraction. In Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp. 833-840, 2011.
-
(2011)
Proceedings of the 28th International Conference on Machine Learning (ICML-11)
, pp. 833-840
-
-
Rifai, S.1
Vincent, P.2
Muller, X.3
Glorot, X.4
Bengio, Y.5
-
34
-
-
84867136416
-
A generative process for sampling contractive auto-encoders
-
Rifai, Salah, Bengio, Yoshua, Dauphin, Yann, and Vincent, Pascal. A generative process for sampling contractive auto-encoders. In International Conference on Machine Learning (ICML), 2012.
-
(2012)
International Conference on Machine Learning (ICML)
-
-
Rifai, S.1
Bengio, Y.2
Dauphin, Y.3
Vincent, P.4
-
36
-
-
85083951635
-
Overfeat: Integrated recognition, localization and detection using convolutional networks
-
Sermanet, Pierre, Eigen, David, Zhang, Xiang, Mathieu, Michaël, Fergus, Rob, and LeCun, Yann. Overfeat: Integrated recognition, localization and detection using convolutional networks. In International Conference on Learning Representations, 2014.
-
(2014)
International Conference on Learning Representations
-
-
Sermanet, P.1
Eigen, D.2
Zhang, X.3
Mathieu, M.4
Fergus, R.5
LeCun, Y.6
-
38
-
-
84919794855
-
Input warping for Bayesian optimization of non-stationary functions
-
Snoek, Jasper, Swersky, Kevin, Zemel, Richard S., and Adams, Ryan P. Input warping for Bayesian optimization of non-stationary functions. In International Conference on Machine Learning, 2014.
-
(2014)
International Conference on Machine Learning
-
-
Snoek, J.1
Swersky, K.2
Zemel, R.S.3
Adams, R.P.4
-
39
-
-
77951951390
-
Kernel choice and classifiability for rkhs embeddings of probability distributions
-
Sriperumbudur, Bharath K, Fukumizu, Kenji, Gretton, Arthur, Lanckriet, Gert RG, and Schölkopf, Bernhard. Kernel choice and classifiability for rkhs embeddings of probability distributions. In Advances in Neural Information Processing Systems, pp. 1750-1758, 2009.
-
(2009)
Advances in Neural Information Processing Systems
, pp. 1750-1758
-
-
Sriperumbudur, B.K.1
Fukumizu, K.2
Gretton, A.3
Lanckriet, G.R.G.4
Schölkopf, B.5
-
40
-
-
84867851372
-
-
Technical report, Department of Computer Science, University of Toronto
-
Susskind, Joshua, Anderson, Adam, and Hinton, Geoffrey E. The toronto face dataset. Technical report, Department of Computer Science, University of Toronto, 2010.
-
(2010)
The Toronto Face Dataset
-
-
Susskind, J.1
Anderson, A.2
Hinton, G.E.3
-
41
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
Sutskever, Ilya, Vinyals, Oriol, and Le, Quoc VV. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems, pp. 3104-3112, 2014.
-
(2014)
Advances in Neural Information Processing Systems
, pp. 3104-3112
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.V.3
-
42
-
-
84964983441
-
-
arXiv preprint arXiv: 1409.4842
-
Szegedy, Christian, Liu, Wei, Jia, Yangqing, Sermanet, Pierre, Reed, Scott, Anguelov, Dragomir, Erhan, Du-mitru, Vanhoucke, Vincent, and Rabinovich, Andrew. Going deeper with convolutions. arXiv preprint arXiv: 1409.4842, 2014.
-
(2014)
Going Deeper with Convolutions
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.-M.7
Vanhoucke, V.8
Rabinovich, A.9
-
43
-
-
56449089103
-
Extracting and composing robust features with denoising autoencoders
-
ACM
-
Vincent, Pascal, Larochelle, Hugo, Bengio, Yoshua, and Manzagol, Pierre-Antoine. Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th international conference on Machine learning, pp. 1096-1103. ACM, 2008.
-
(2008)
Proceedings of the 25th International Conference on Machine Learning
, pp. 1096-1103
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.-A.4
-
44
-
-
84939821075
-
-
arXiv preprint arXiv:1411.4555
-
Vinyals, Oriol, Toshev, Alexander, Bengio, Samy, and Er-han, Dumitru. Show and tell: A neural image caption generator. arXiv preprint arXiv:1411.4555, 2014.
-
(2014)
Show and Tell: A Neural Image Caption Generator
-
-
Vinyals, O.1
Toshev, A.2
Bengio, S.3
Er-Han, D.4
-
45
-
-
77956001004
-
Deconvolutional networks
-
IEEE
-
Zeiler, Matthew D, Krishnan, Dilip, Taylor, Graham W, and Fergus, Robert. Deconvolutional networks. In Computer Vision and Pattern Recognition, pp. 2528-2535. IEEE, 2010.
-
(2010)
Computer Vision and Pattern Recognition
, pp. 2528-2535
-
-
Zeiler, M.D.1
Krishnan, D.2
Taylor, G.W.3
Fergus, R.4
-
46
-
-
84897542525
-
Learning fair representations
-
Zemel, Richard, Wu, Yu, Swersky, Kevin, Pitassi, Toni, and Dwork, Cynthia. Learning fair representations. In International Conference on Machine Learning, pp. 325-333, 2013.
-
(2013)
International Conference on Machine Learning
, pp. 325-333
-
-
Zemel, R.1
Wu, Y.2
Swersky, K.3
Pitassi, T.4
Dwork, C.5
|