-
2
-
-
0027599793
-
Universal approximation bounds for superpositions of a sigmoidal function
-
Barron, A.E.: Universal approximation bounds for superpositions of a sigmoidal function. IEEE Trans. on Information Theory 39, 930-945 (1993)
-
(1993)
IEEE Trans. on Information Theory
, vol.39
, pp. 930-945
-
-
Barron, A.E.1
-
3
-
-
69349090197
-
Learning deep architectures for AI
-
Also published as a book. Now Publishers
-
Bengio, Y.: Learning deep architectures for AI. Foundations and Trends in Machine Learning 2(1), 1-127 (2009); Also published as a book. Now Publishers
-
(2009)
Foundations and Trends in Machine Learning
, vol.2
, Issue.1
, pp. 1-127
-
-
Bengio, Y.1
-
5
-
-
67651049775
-
Justifying and generalizing contrastive divergence
-
Bengio, Y., Delalleau, O.: Justifying and generalizing contrastive divergence. Neural Computation 21(6), 1601-1621 (2009)
-
(2009)
Neural Computation
, vol.21
, Issue.6
, pp. 1601-1621
-
-
Bengio, Y.1
Delalleau, O.2
-
7
-
-
34547975052
-
Scaling learning algorithms towards AI
-
Bottou, L., Chapelle, O., DeCoste, D., Weston, J. (eds.) MIT Press, Cambridge
-
Bengio, Y., LeCun, Y.: Scaling learning algorithms towards AI. In: Bottou, L., Chapelle, O., DeCoste, D., Weston, J. (eds.) Large Scale Kernel Machines. MIT Press, Cambridge (2007)
-
(2007)
Large Scale Kernel Machines
-
-
Bengio, Y.1
LeCun, Y.2
-
8
-
-
84898947097
-
Non-local manifold tangent learning
-
Saul, L., Weiss, Y., Bottou, L. (eds.) MIT Press, Cambridge
-
Bengio, Y., Monperrus, M.: Non-local manifold tangent learning. In: Saul, L., Weiss, Y., Bottou, L. (eds.) Advances in Neural Information Processing Systems, NIPS 2004, vol. 17, pp. 129-136. MIT Press, Cambridge (2005)
-
(2005)
Advances in Neural Information Processing Systems, NIPS 2004
, vol.17
, pp. 129-136
-
-
Bengio, Y.1
Monperrus, M.2
-
9
-
-
77954662106
-
The curse of highly variable functions for local kernel machines
-
Weiss, Y., Schölkopf, B., Platt, J. (eds.) MIT Press, Cambridge
-
Bengio, Y., Delalleau, O., Le Roux, N.: The curse of highly variable functions for local kernel machines. In: Weiss, Y., Schölkopf, B., Platt, J. (eds.) Advances in Neural Information Processing Systems (NIPS 2005), vol. 18, pp. 107-114. MIT Press, Cambridge (2006)
-
(2006)
Advances in Neural Information Processing Systems (NIPS 2005)
, vol.18
, pp. 107-114
-
-
Bengio, Y.1
Delalleau, O.2
Le Roux, N.3
-
10
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
Schölkopf, B., Platt, J., Hoffman, T. (eds.) MIT Press, Cambridge
-
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. In: Schölkopf, B., Platt, J., Hoffman, T. (eds.) Advances in Neural Information Processing Systems (NIPS 2006), vol. 19, pp. 153-160. MIT Press, Cambridge (2007)
-
(2007)
Advances in Neural Information Processing Systems (NIPS 2006)
, vol.19
, pp. 153-160
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
11
-
-
78649265006
-
Decision trees do not generalize to new variations
-
Bengio, Y., Delalleau, O., Simard, C.: Decision trees do not generalize to new variations. Computational Intelligence 26(4), 449-467 (2010)
-
(2010)
Computational Intelligence
, vol.26
, Issue.4
, pp. 449-467
-
-
Bengio, Y.1
Delalleau, O.2
Simard, C.3
-
12
-
-
84862302263
-
Deep learners benefit more from out-of-distribution examples
-
Bengio, Y., Bastien, F., Bergeron, A., Boulanger-Lewandowski, N., Breuel, T., Chherawala, Y., Cisse, M., Côté, M., Erhan, D., Eustache, J., Glorot, X., Muller, X., Pannetier Lebeuf, S., Pascanu, R., Rifai, S., Savard, F., Sicard, G.: Deep learners benefit more from out-of-distribution examples. In: JMLR W&CP: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011 (2011)
-
JMLR W&CP: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011 (2011)
-
-
Bengio, Y.1
Bastien, F.2
Bergeron, A.3
Boulanger-Lewandowski, N.4
Breuel, T.5
Chherawala, Y.6
Cisse, M.7
Côté, M.8
Erhan, D.9
Eustache, J.10
Glorot, X.11
Muller, X.12
Pannetier Lebeuf, S.13
Pascanu, R.14
Rifai, S.15
Savard, F.16
Sicard, G.17
-
13
-
-
0024220237
-
Auto-association by multilayer perceptrons and singular value decomposition
-
Bourlard, H., Kamp, Y.: Auto-association by multilayer perceptrons and singular value decomposition. Biological Cybernetics 59, 291-294 (1988)
-
(1988)
Biological Cybernetics
, vol.59
, pp. 291-294
-
-
Bourlard, H.1
Kamp, Y.2
-
14
-
-
79953693034
-
Poly-logarithmic independence fools bounded-depth boolean circuits
-
Braverman, M.: Poly-logarithmic independence fools bounded-depth boolean circuits. Communications of the ACM 54(4), 108-115 (2011)
-
(2011)
Communications of the ACM
, vol.54
, Issue.4
, pp. 108-115
-
-
Braverman, M.1
-
15
-
-
79959650504
-
Quickly generating representative samples from an rbm-derived process
-
Breuleux, O., Bengio, Y., Vincent, P.: Quickly generating representative samples from an rbm-derived process. Neural Computation 23(8), 2058-2073 (2011)
-
(2011)
Neural Computation
, vol.23
, Issue.8
, pp. 2058-2073
-
-
Breuleux, O.1
Bengio, Y.2
Vincent, P.3
-
16
-
-
0005594495
-
Signature verification using a siamese time delay neural network
-
World Scientific, Singapore
-
Bromley, J., Benz, J., Bottou, L., Guyon, I., Jackel, L., LeCun, Y., Moore, C., Sackinger, E., Shah, R.: Signature verification using a siamese time delay neural network. In: Advances in Pattern Recognition Systems using Neural Network Technologies, pp. 669-687. World Scientific, Singapore (1993)
-
(1993)
Advances in Pattern Recognition Systems Using Neural Network Technologies
, pp. 669-687
-
-
Bromley, J.1
Benz, J.2
Bottou, L.3
Guyon, I.4
Jackel, L.5
LeCun, Y.6
Moore, C.7
Sackinger, E.8
Shah, R.9
-
17
-
-
85153936556
-
Learning many related tasks at the same time with backpropagation
-
Tesauro, G., Touretzky, D., Leen, T. (eds.) MIT Press, Cambridge
-
Caruana, R.: Learning many related tasks at the same time with backpropagation. In: Tesauro, G., Touretzky, D., Leen, T. (eds.) Advances in Neural Information Processing Systems (NIPS 1994), vol. 7, pp. 657-664. MIT Press, Cambridge (1995)
-
(1995)
Advances in Neural Information Processing Systems (NIPS 1994)
, vol.7
, pp. 657-664
-
-
Caruana, R.1
-
18
-
-
24644436425
-
Learning a similarity metric discriminatively, with application to face verification
-
IEEE Press, Los Alamitos
-
Chopra, S., Hadsell, R., LeCun, Y.: Learning a similarity metric discriminatively, with application to face verification. In: Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR 2005). IEEE Press, Los Alamitos (2005)
-
(2005)
Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR 2005)
-
-
Chopra, S.1
Hadsell, R.2
LeCun, Y.3
-
19
-
-
56449095373
-
A unified architecture for natural language processing: Deep neural networks with multitask learning
-
Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) ACM, New York
-
Collobert, R., Weston, J.: A unified architecture for natural language processing: Deep neural networks with multitask learning. In: Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008), pp. 160-167. ACM, New York (2008)
-
(2008)
Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008)
, pp. 160-167
-
-
Collobert, R.1
Weston, J.2
-
20
-
-
84862293204
-
Tempered Markov chain monte carlo for training of restricted Boltzmann machine
-
Desjardins, G., Courville, A., Bengio, Y., Vincent, P., Delalleau, O.: Tempered Markov chain monte carlo for training of restricted Boltzmann machine. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2010), pp. 145-152 (2010)
-
(2010)
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2010)
, pp. 145-152
-
-
Desjardins, G.1
Courville, A.2
Bengio, Y.3
Vincent, P.4
Delalleau, O.5
-
21
-
-
77949522811
-
Why does unsupervised pre-training help deep learning?
-
Erhan, D., Bengio, Y., Courville, A., Manzagol, P.-A., Vincent, P., Bengio, S.: Why does unsupervised pre-training help deep learning? Journal of Machine Learning Research 11, 625-660 (2010)
-
(2010)
Journal of Machine Learning Research
, vol.11
, pp. 625-660
-
-
Erhan, D.1
Bengio, Y.2
Courville, A.3
Manzagol, P.-A.4
Vincent, P.5
Bengio, S.6
-
23
-
-
80053443013
-
Domain adaptation for large-scale sentiment classification: A deep learning approach
-
Glorot, X., Bordes, A., Bengio, Y.: Domain adaptation for large-scale sentiment classification: A deep learning approach. In: Proceedings of the Twenty-eight International Conference on Machine Learning, ICML 2011 (2011)
-
Proceedings of the Twenty-eight International Conference on Machine Learning, ICML 2011 (2011)
-
-
Glorot, X.1
Bordes, A.2
Bengio, Y.3
-
24
-
-
84860644702
-
Measuring invariances in deep networks
-
Bengio, Y., Schuurmans, D., Williams, C., Lafferty, J., Culotta, A. (eds.)
-
Goodfellow, I., Le, Q., Saxe, A., Ng, A.: Measuring invariances in deep networks. In: Bengio, Y., Schuurmans, D., Williams, C., Lafferty, J., Culotta, A. (eds.) Advances in Neural Information Processing Systems (NIPS 2009), vol. 22, pp. 646-654 (2009)
-
(2009)
Advances in Neural Information Processing Systems (NIPS 2009)
, vol.22
, pp. 646-654
-
-
Goodfellow, I.1
Le, Q.2
Saxe, A.3
Ng, A.4
-
26
-
-
33845594569
-
Dimensionality reduction by learning an invariant mapping
-
IEEE Press, Los Alamitos
-
Hadsell, R., Chopra, S., LeCun, Y.: Dimensionality reduction by learning an invariant mapping. In: Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR 2006), pp. 1735-1742. IEEE Press, Los Alamitos (2006)
-
(2006)
Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR 2006)
, pp. 1735-1742
-
-
Hadsell, R.1
Chopra, S.2
LeCun, Y.3
-
27
-
-
69549124128
-
Deep belief net learning in a long-range vision system for autonomous off-road driving
-
Hadsell, R., Erkan, A., Sermanet, P., Scoffier, M., Muller, U., LeCun, Y.: Deep belief net learning in a long-range vision system for autonomous off-road driving. In: Proc. Intelligent Robots and Systems (IROS 2008), pp. 628-633 (2008)
-
(2008)
Proc. Intelligent Robots and Systems (IROS 2008)
, pp. 628-633
-
-
Hadsell, R.1
Erkan, A.2
Sermanet, P.3
Scoffier, M.4
Muller, U.5
LeCun, Y.6
-
29
-
-
0001295178
-
On the power of small-depth threshold circuits
-
Håstad, J., Goldmann, M.: On the power of small-depth threshold circuits. Computational Complexity 1, 113-129 (1991)
-
(1991)
Computational Complexity
, vol.1
, pp. 113-129
-
-
Håstad, J.1
Goldmann, M.2
-
31
-
-
0024732792
-
Connectionist learning procedures
-
Hinton, G.E.: Connectionist learning procedures. Artificial Intelligence 40, 185-234 (1989)
-
(1989)
Artificial Intelligence
, vol.40
, pp. 185-234
-
-
Hinton, G.E.1
-
33
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
Hinton, G.E., Salakhutdinov, R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504-507 (2006)
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.E.1
Salakhutdinov, R.2
-
34
-
-
0002834189
-
Autoencoders, minimum description length, and helmholtz free energy
-
Cowan, D., Tesauro, G., Alspector, J. (eds.) Morgan Kaufmann Publishers, Inc., San Francisco
-
Hinton, G.E., Zemel, R.S.: Autoencoders, minimum description length, and helmholtz free energy. In: Cowan, D., Tesauro, G., Alspector, J. (eds.) Advances in Neural Information Processing Systems (NIPS 1993), vol. 6, pp. 3-10. Morgan Kaufmann Publishers, Inc., San Francisco (1994)
-
(1994)
Advances in Neural Information Processing Systems (NIPS 1993)
, vol.6
, pp. 3-10
-
-
Hinton, G.E.1
Zemel, R.S.2
-
35
-
-
0004243089
-
-
Technical Report TR-CMU-CS-84-119, Carnegie-Mellon University, Dept. of Computer Science
-
Hinton, G.E., Sejnowski, T.J., Ackley, D.H.: Boltzmann machines: Constraint satisfaction networks that learn. Technical Report TR-CMU-CS-84-119, Carnegie-Mellon University, Dept. of Computer Science (1984)
-
(1984)
Boltzmann Machines: Constraint Satisfaction Networks That Learn
-
-
Hinton, G.E.1
Sejnowski, T.J.2
Ackley, D.H.3
-
36
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
Hinton, G.E., Osindero, S., Teh, Y.: A fast learning algorithm for deep belief nets. Neural Computation 18, 1527-1554 (2006)
-
(2006)
Neural Computation
, vol.18
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.3
-
37
-
-
22044434800
-
Estimation of non-normalized statistical models using score matching
-
Hyvärinen, A.: Estimation of non-normalized statistical models using score matching. Journal of Machine Learning Research 6, 695-709 (2005)
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 695-709
-
-
Hyvärinen, A.1
-
38
-
-
77953183471
-
What is the best multi-stage architecture for object recognition?
-
IEEE, Los Alamitos
-
Jarrett, K., Kavukcuoglu, K., Ranzato, M., LeCun, Y.: What is the best multi-stage architecture for object recognition? In: Proc. International Conference on Computer Vision (ICCV 2009), pp. 2146-2153. IEEE, Los Alamitos (2009)
-
(2009)
Proc. International Conference on Computer Vision (ICCV 2009)
, pp. 2146-2153
-
-
Jarrett, K.1
Kavukcuoglu, K.2
Ranzato, M.3
LeCun, Y.4
-
39
-
-
70049083257
-
-
Technical report, Computational and Biological Learning Lab, Courant Institute, NYU. Tech Report CBLL-TR-2008-12-01
-
Kavukcuoglu, K., Ranzato, M., LeCun, Y.: Fast inference in sparse coding algorithms with applications to object recognition. Technical report, Computational and Biological Learning Lab, Courant Institute, NYU. Tech Report CBLL-TR-2008-12-01 (2008)
-
(2008)
Fast Inference in Sparse Coding Algorithms with Applications to Object Recognition
-
-
Kavukcuoglu, K.1
Ranzato, M.2
LeCun, Y.3
-
40
-
-
70450177775
-
Learning invariant features through topographic filter maps
-
IEEE, Los Alamitos
-
Kavukcuoglu, K., Ranzato, M., Fergus, R., LeCun, Y.: Learning invariant features through topographic filter maps. In: Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR 2009), pp. 1605-1612. IEEE, Los Alamitos (2009)
-
(2009)
Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR 2009)
, pp. 1605-1612
-
-
Kavukcuoglu, K.1
Ranzato, M.2
Fergus, R.3
LeCun, Y.4
-
41
-
-
85162419825
-
Regularized estimation of image statistics by score matching
-
Lafferty, J., Williams, C.K.I., Shawe-Taylor, J., Zemel, R., Culotta, A. (eds.)
-
Kingma, D., LeCun, Y.: Regularized estimation of image statistics by score matching. In: Lafferty, J., Williams, C.K.I., Shawe-Taylor, J., Zemel, R., Culotta, A. (eds.) Advances in Neural Information Processing Systems, vol. 23, pp. 1126-1134 (2010)
-
(2010)
Advances in Neural Information Processing Systems
, vol.23
, pp. 1126-1134
-
-
Kingma, D.1
LeCun, Y.2
-
42
-
-
34547967782
-
An empirical evaluation of deep architectures on problems with many factors of variation
-
Ghahramani, Z. (ed.) ACM, New York
-
Larochelle, H., Erhan, D., Courville, A., Bergstra, J., Bengio, Y.: An empirical evaluation of deep architectures on problems with many factors of variation. In: Ghahramani, Z. (ed.) Proceedings of the Twenty-fourth International Conference on Machine Learning (ICML 2007), pp. 473-480. ACM, New York (2007)
-
(2007)
Proceedings of the Twenty-fourth International Conference on Machine Learning (ICML 2007)
, pp. 473-480
-
-
Larochelle, H.1
Erhan, D.2
Courville, A.3
Bergstra, J.4
Bengio, Y.5
-
43
-
-
84862276188
-
Deep learning using robust interdependent codes
-
Larochelle, H., Erhan, D., Vincent, P.: Deep learning using robust interdependent codes. In: Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS 2009), pp. 312-319 (2009)
-
(2009)
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS 2009)
, pp. 312-319
-
-
Larochelle, H.1
Erhan, D.2
Vincent, P.3
-
44
-
-
45749110924
-
Representational power of restricted Boltzmann machines and deep belief networks
-
Le Roux, N., Bengio, Y.: Representational power of restricted Boltzmann machines and deep belief networks. Neural Computation 20(6), 1631-1649 (2008)
-
(2008)
Neural Computation
, vol.20
, Issue.6
, pp. 1631-1649
-
-
Le Roux, N.1
Bengio, Y.2
-
45
-
-
71149119164
-
Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
-
Bottou, L., Littman, M. (eds.) ACM, Montreal
-
Lee, H., Grosse, R., Ranganath, R., Ng, A.Y.: Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: Bottou, L., Littman, M. (eds.) Proceedings of the Twenty-sixth International Conference on Machine Learning (ICML 2009). ACM, Montreal (2009)
-
(2009)
Proceedings of the Twenty-sixth International Conference on Machine Learning (ICML 2009)
-
-
Lee, H.1
Grosse, R.2
Ranganath, R.3
Ng, A.Y.4
-
46
-
-
0037452922
-
The cost of cortical computation
-
Lennie, P.: The cost of cortical computation. Current Biology 13(6), 493-497 (2003)
-
(2003)
Current Biology
, vol.13
, Issue.6
, pp. 493-497
-
-
Lennie, P.1
-
47
-
-
84906491858
-
Unsupervised and transfer learning challenge: A deep learning approach
-
Mesnil, G., Dauphin, Y., Glorot, X., Rifai, S., Bengio, Y., Goodfellow, I., Lavoie, E., Muller, X., Desjardins, G., Warde-Farley, D., Vincent, P., Courville, A., Bergstra, J.: Unsupervised and transfer learning challenge: a deep learning approach. In: Workshop on Unsupervised and Transfer Learning, ICML 2011 (2011)
-
Workshop on Unsupervised and Transfer Learning, ICML 2011 (2011)
-
-
Mesnil, G.1
Dauphin, Y.2
Glorot, X.3
Rifai, S.4
Bengio, Y.5
Goodfellow, I.6
Lavoie, E.7
Muller, X.8
Desjardins, G.9
Warde-Farley, D.10
Vincent, P.11
Courville, A.12
Bergstra, J.13
-
48
-
-
71149084945
-
Deep learning from temporal coherence in video
-
Bottou, L., Littman, M. (eds.) Omnipress, Montreal
-
Mobahi, H., Collobert, R., Weston, J.: Deep learning from temporal coherence in video. In: Bottou, L., Littman, M. (eds.) Proceedings of the 26th International Conference on Machine Learning, pp. 737-744. Omnipress, Montreal (2009)
-
(2009)
Proceedings of the 26th International Conference on Machine Learning
, pp. 737-744
-
-
Mobahi, H.1
Collobert, R.2
Weston, J.3
-
49
-
-
0030779611
-
Sparse coding with an overcomplete basis set: A strategy employed by V1?
-
Olshausen, B.A., Field, D.J.: Sparse coding with an overcomplete basis set: a strategy employed by V1? Vision Research 37, 3311-3325 (1997)
-
(1997)
Vision Research
, vol.37
, pp. 3311-3325
-
-
Olshausen, B.A.1
Field, D.J.2
-
50
-
-
85161976678
-
Modeling image patches with a directed hierarchy of markov random field
-
Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) MIT Press, Cambridge
-
Osindero, S., Hinton, G.E.: Modeling image patches with a directed hierarchy of markov random field. In: Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) Advances in Neural Information Processing Systems (NIPS 2007), vol. 20, pp. 1121-1128. MIT Press, Cambridge (2008)
-
(2008)
Advances in Neural Information Processing Systems (NIPS 2007)
, vol.20
, pp. 1121-1128
-
-
Osindero, S.1
Hinton, G.E.2
-
51
-
-
80054123592
-
Sum-product networks: A new deep architecture
-
Poon, H., Domingos, P.: Sum-product networks: A new deep architecture. In: NIPS, Workshop on Deep Learning and Unsupervised Feature Learning, Whistler, Canada (2010)
-
NIPS, Workshop on Deep Learning and Unsupervised Feature Learning, Whistler, Canada (2010)
-
-
Poon, H.1
Domingos, P.2
-
53
-
-
84864069017
-
Efficient learning of sparse representations with an energy-based model
-
Schölkopf, B., Platt, J., Hoffman, T. (eds.) MIT Press, Cambridge
-
Ranzato, M., Poultney, C., Chopra, S., LeCun, Y.: Efficient learning of sparse representations with an energy-based model. In: Schölkopf, B., Platt, J., Hoffman, T. (eds.) Advances in Neural Information Processing Systems (NIPS 2006), vol. 19, pp. 1137-1144. MIT Press, Cambridge (2007)
-
(2007)
Advances in Neural Information Processing Systems (NIPS 2006)
, vol.19
, pp. 1137-1144
-
-
Ranzato, M.1
Poultney, C.2
Chopra, S.3
LeCun, Y.4
-
54
-
-
85161966246
-
Sparse feature learning for deep belief networks
-
Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) MIT Press, Cambridge
-
Ranzato, M., Boureau, Y.-L., LeCun, Y.: Sparse feature learning for deep belief networks. In: Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) Advances in Neural Information Processing Systems (NIPS 2007), vol. 20, pp. 1185-1192. MIT Press, Cambridge (2008)
-
(2008)
Advances in Neural Information Processing Systems (NIPS 2007)
, vol.20
, pp. 1185-1192
-
-
Ranzato, M.1
Boureau, Y.-L.2
LeCun, Y.3
-
55
-
-
80053460450
-
Contractive auto-encoders: Explicit invariance during feature extraction
-
Rifai, S., Vincent, P., Muller, X., Glorot, X., Bengio, Y.: Contractive auto-encoders: Explicit invariance during feature extraction. In: Proceedings of the Twenty-eight International Conference on Machine Learning, ICML 2011 (2011)
-
Proceedings of the Twenty-eight International Conference on Machine Learning, ICML 2011 (2011)
-
-
Rifai, S.1
Vincent, P.2
Muller, X.3
Glorot, X.4
Bengio, Y.5
-
56
-
-
0022471098
-
Learning representations by backpropagating errors
-
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by backpropagating errors. Nature 323, 533-536 (1986)
-
(1986)
Nature
, vol.323
, pp. 533-536
-
-
Rumelhart, D.E.1
Hinton, G.E.2
Williams, R.J.3
-
59
-
-
80053165824
-
-
Technical Report MIT-CSAIL-TR-2010-037, Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology
-
Salakhutdinov, R., Hinton, G.E.: An efficient learning procedure for deep Boltzmann machines. Technical Report MIT-CSAIL-TR-2010-037, Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (2010)
-
(2010)
An Efficient Learning Procedure for Deep Boltzmann Machines
-
-
Salakhutdinov, R.1
Hinton, G.E.2
-
60
-
-
34547983260
-
Restricted Boltzmann machines for collaborative filtering
-
Ghahramani, Z. (ed.) ACM, New York
-
Salakhutdinov, R., Mnih, A., Hinton, G.E.: Restricted Boltzmann machines for collaborative filtering. In: Ghahramani, Z. (ed.) Proceedings of the Twenty-fourth International Conference on Machine Learning (ICML 2007), pp. 791-798. ACM, New York (2007)
-
(2007)
Proceedings of the Twenty-fourth International Conference on Machine Learning (ICML 2007)
, pp. 791-798
-
-
Salakhutdinov, R.1
Mnih, A.2
Hinton, G.E.3
-
61
-
-
0000329993
-
Information processing in dynamical systems: Foundations of harmony theory
-
Rumelhart, D.E., McClelland, J.L. (eds.) ch. 6, . MIT Press, Cambridge
-
Smolensky, P.: Information processing in dynamical systems: Foundations of harmony theory. In: Rumelhart, D.E., McClelland, J.L. (eds.) Parallel Distributed Processing, vol. 1, ch. 6, pp. 194-281. MIT Press, Cambridge (1986)
-
(1986)
Parallel Distributed Processing
, vol.1
, pp. 194-281
-
-
Smolensky, P.1
-
62
-
-
56449086223
-
Training restricted boltzmann machines using approximations to the likelihood gradient
-
Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) ACM, New York
-
Tieleman, T.: Training restricted boltzmann machines using approximations to the likelihood gradient. In: Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008), pp. 1064-1071. ACM, New York (2008)
-
(2008)
Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008)
, pp. 1064-1071
-
-
Tieleman, T.1
-
63
-
-
71149084943
-
Using fast weights to improve persistent contrastive divergence
-
Bottou, L., Littman, M. (eds.) ACM, New York
-
Tieleman, T., Hinton, G.: Using fast weights to improve persistent contrastive divergence. In: Bottou, L., Littman, M. (eds.) Proceedings of the Twenty-sixth International Conference on Machine Learning (ICML 2009), pp. 1033-1040. ACM, New York (2009)
-
(2009)
Proceedings of the Twenty-sixth International Conference on Machine Learning (ICML 2009)
, pp. 1033-1040
-
-
Tieleman, T.1
Hinton, G.2
-
64
-
-
79959575293
-
A connection between score matching and denoising autoencoders
-
Vincent, P.: A connection between score matching and denoising autoencoders. Neural Computation 23(7), 1661-1674 (2011)
-
(2011)
Neural Computation
, vol.23
, Issue.7
, pp. 1661-1674
-
-
Vincent, P.1
-
65
-
-
56449089103
-
Extracting and composing robust features with denoising autoencoders
-
Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) ACM, New York
-
Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.-A.: Extracting and composing robust features with denoising autoencoders. In: Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008), pp. 1096-1103. ACM, New York (2008)
-
(2008)
Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008)
, pp. 1096-1103
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.-A.4
-
66
-
-
79551480483
-
Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
-
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.-A.: Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research 11, 3371-3408 (2010)
-
(2010)
Journal of Machine Learning Research
, vol.11
, pp. 3371-3408
-
-
Vincent, P.1
Larochelle, H.2
Lajoie, I.3
Bengio, Y.4
Manzagol, P.-A.5
-
68
-
-
56449119888
-
Deep learning via semi-supervised embedding
-
Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) ACM, New York
-
Weston, J., Ratle, F., Collobert, R.: Deep learning via semi-supervised embedding. In: Cohen, W.W., McCallum, A., Roweis, S.T. (eds.) Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008), pp. 1168-1175. ACM, New York (2008)
-
(2008)
Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008)
, pp. 1168-1175
-
-
Weston, J.1
Ratle, F.2
Collobert, R.3
-
69
-
-
0000459353
-
The lack of a priori distinction between learning algorithms
-
Wolpert, D.H.: The lack of a priori distinction between learning algorithms. Neural Computation 8(7), 1341-1390 (1996)
-
(1996)
Neural Computation
, vol.8
, Issue.7
, pp. 1341-1390
-
-
Wolpert, D.H.1
-
71
-
-
33644756784
-
On the convergence of markovian stochastic algorithms with rapidly decreasing ergodicity rates
-
Younes, L.: On the convergence of markovian stochastic algorithms with rapidly decreasing ergodicity rates. Stochastics and Stochastic Reports 65(3), 177-228 (1999)
-
(1999)
Stochastics and Stochastic Reports
, vol.65
, Issue.3
, pp. 177-228
-
-
Younes, L.1
|