-
1
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
G. Hinton and R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.1
Salakhutdinov, R.2
-
2
-
-
34547983260
-
Restricted Boltzmann machines for collaborative filtering
-
R. Salakhutdinov, A. Mnih, and G. Hinton, "Restricted Boltzmann machines for collaborative filtering," in International Conference on Machine learning, 2007, pp. 791-798.
-
International Conference on Machine Learning, 2007
, pp. 791-798
-
-
Salakhutdinov, R.1
Mnih, A.2
Hinton, G.3
-
4
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
MIT Press
-
Y. Bengio, P. Lamblin, D. Popovici, H. Larochelle, and U. Montreal, "Greedy layer-wise training of deep networks," in Advances in Neural Information Processing Systems. MIT Press, 2007.
-
(2007)
Advances in Neural Information Processing Systems
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
Montreal, U.5
-
6
-
-
71149119164
-
Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
-
H. Lee, R. Grosse, R. Ranganath, and A. Ng, "Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations," in International Conference on Machine Learning, 2009.
-
International Conference on Machine Learning, 2009
-
-
Lee, H.1
Grosse, R.2
Ranganath, R.3
Ng, A.4
-
7
-
-
0036546660
-
Slow feature analysis: Unsupervised learning of invariances
-
L. Wiskott and T. Sejnowski, "Slow feature analysis: Unsupervised learning of invariances," Neural Computation, vol. 14, no. 4, pp. 715-770, 2002.
-
(2002)
Neural Computation
, vol.14
, Issue.4
, pp. 715-770
-
-
Wiskott, L.1
Sejnowski, T.2
-
9
-
-
84860644702
-
Measuring invariances in deep networks
-
I. J. Goodfellow, Q. V. Le, A. M. Saxe, H. Lee, and A. Y. Ng., "Measuring invariances in deep networks," Advances in neural information processing systems, 2009.
-
(2009)
Advances in Neural Information Processing Systems
-
-
Goodfellow, I.J.1
Le, Q.V.2
Saxe, A.M.3
Lee, H.4
Ng, A.Y.5
-
10
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, no. 7, pp. 1527-1554, 2006.
-
(2006)
Neural Computation
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.1
Osindero, S.2
Teh, Y.3
-
11
-
-
0000355193
-
Parametric inference for imperfectly observed Gibbsian fields
-
L. Younes, "Parametric inference for imperfectly observed Gibbsian fields," Probability Theory and Related Fields, vol. 82, no. 4, pp. 625-645, 1989.
-
(1989)
Probability Theory and Related Fields
, vol.82
, Issue.4
, pp. 625-645
-
-
Younes, L.1
-
12
-
-
56449086223
-
Training restricted Boltzmann machines using approximations to the likelihood gradient
-
T. Tieleman, "Training restricted Boltzmann machines using approximations to the likelihood gradient," in International conference on Machine Learning, 2008, pp. 1064-1071.
-
International Conference on Machine Learning, 2008
, pp. 1064-1071
-
-
Tieleman, T.1
-
13
-
-
84899000641
-
Exponential family harmoniums with an application to information retrieval
-
M. Welling, M. Rosen-Zvi, and G. Hinton, "Exponential family harmoniums with an application to information retrieval," Advances in neural information processing systems, vol. 17, pp. 1481-1488, 2005.
-
(2005)
Advances in Neural Information Processing Systems
, vol.17
, pp. 1481-1488
-
-
Welling, M.1
Rosen-Zvi, M.2
Hinton, G.3
-
14
-
-
0013344078
-
Training products of experts by minimizing contrastive divergence
-
G. E. Hinton, "Training products of experts by minimizing contrastive divergence," Neural Computation, vol. 14, p. 2002, 2002.
-
(2002)
Neural Computation
, vol.14
, pp. 2002
-
-
Hinton, G.E.1
-
19
-
-
0043224019
-
A Newton-Raphson version of the multivariate Robbins-Monro procedure
-
D. Ruppert, "A Newton-Raphson version of the multivariate Robbins-Monro procedure," Ann. Statist., vol. 13, no. 1, pp. 236-245, 1985.
-
(1985)
Ann. Statist.
, vol.13
, Issue.1
, pp. 236-245
-
-
Ruppert, D.1
-
20
-
-
0032260190
-
Adaptive stochastic approximation by the simultaneous perturbation method
-
J. Spall, "Adaptive stochastic approximation by the simultaneous perturbation method," IEEE Conference on Decision and Control, pp. 3872-3879, 1998.
-
(1998)
IEEE Conference on Decision and Control
, pp. 3872-3879
-
-
Spall, J.1
-
21
-
-
0000828406
-
A new method of stochastic approximation type
-
B. T. Polyak, "A new method of stochastic approximation type," Avtomat. i Telemekh., no. 7, pp. 98-107, 1990.
-
(1990)
Avtomat. I Telemekh.
, Issue.7
, pp. 98-107
-
-
Polyak, B.T.1
-
23
-
-
0019608150
-
Averaging methods for the asymptotic analysis of learning and adaptive systems, with small adjustment rate
-
H. J. Kushner and H. Huang, "Averaging methods for the asymptotic analysis of learning and adaptive systems, with small adjustment rate," SIAM J. Control Optim., vol. 19, no. 5, pp. 635-650, 1981.
-
(1981)
SIAM J. Control Optim.
, vol.19
, Issue.5
, pp. 635-650
-
-
Kushner, H.J.1
Huang, H.2
-
24
-
-
0002824293
-
Asymptotic properties of stochastic approximations with constant coefficients
-
-, "Asymptotic properties of stochastic approximations with constant coefficients," SIAM J. Control Optim., vol. 19, no. 1, pp. 87-105, 1981.
-
(1981)
SIAM J. Control Optim.
, vol.19
, Issue.1
, pp. 87-105
-
-
-
28
-
-
0346881152
-
Steepest descent with momentum for quadratic functions is a version of the conjugate gradient method
-
A. Bhaya and E. Kaszkurewicz, "Steepest descent with momentum for quadratic functions is a version of the conjugate gradient method," Neural Networks, vol. 17, no. 1, pp. 65-71, 2004.
-
(2004)
Neural Networks
, vol.17
, Issue.1
, pp. 65-71
-
-
Bhaya, A.1
Kaszkurewicz, E.2
-
29
-
-
0032069997
-
Analysis of momentum adaptive filtering algorithms
-
May
-
R. Sharma, W. Sethares, and J. Bucklew, "Analysis of momentum adaptive filtering algorithms," IEEE Transactions on Signal Processing, vol. 46, no. 5, pp. 1430-1434, May 1998.
-
(1998)
IEEE Transactions on Signal Processing
, vol.46
, Issue.5
, pp. 1430-1434
-
-
Sharma, R.1
Sethares, W.2
Bucklew, J.3
-
31
-
-
79959651429
-
Herding Dynamic Weights for Partially Observed Random Field Models
-
M. Welling, "Herding Dynamic Weights for Partially Observed Random Field Models," in UAI, 2009.
-
(2009)
UAI
-
-
Welling, M.1
-
32
-
-
77952686979
-
Generalization error bounds for aggregation by mirror descent with averaging
-
A. Juditsky, A. Nazin, A. Tsybakov, and N. Vayatis, "Generalization error bounds for aggregation by mirror descent with averaging," Advances in neural information processing systems, 2005.
-
(2005)
Advances in Neural Information Processing Systems
-
-
Juditsky, A.1
Nazin, A.2
Tsybakov, A.3
Vayatis, N.4
-
33
-
-
0030242092
-
General results on the convergence of stochastic algorithms
-
B. Delyon, "General results on the convergence of stochastic algorithms," IEEE Transactions on Automatic Control, vol. 41, no. 9, pp. 1245-1255, 1996.
-
(1996)
IEEE Transactions on Automatic Control
, vol.41
, Issue.9
, pp. 1245-1255
-
-
Delyon, B.1
-
34
-
-
57849088168
-
A tutorial on adaptive MCMC
-
C. Andrieu and J. Thoms, "A tutorial on adaptive MCMC," Statistics and Computing, vol. 18, no. 4, pp. 343-373, 2008.
-
(2008)
Statistics and Computing
, vol.18
, Issue.4
, pp. 343-373
-
-
Andrieu, C.1
Thoms, J.2
-
37
-
-
50549197532
-
Some methods of speeding up the convergence of iterative methods
-
B. Polyak, "Some methods of speeding up the convergence of iterative methods," USSR Computational Mathematics and Mathematical Physics., vol. 4, pp. 1-17, 1964.
-
(1964)
USSR Computational Mathematics and Mathematical Physics
, vol.4
, pp. 1-17
-
-
Polyak, B.1
-
40
-
-
59449087310
-
Exploring Strategies for Training Deep Neural Networks
-
H. Larochelle, Y. Bengio, J. Louradour, and P. Lamblin, "Exploring Strategies for Training Deep Neural Networks," Journal of Machine Learning Research, vol. 1, pp. 1-40, 2009.
-
(2009)
Journal of Machine Learning Research
, vol.1
, pp. 1-40
-
-
Larochelle, H.1
Bengio, Y.2
Louradour, J.3
Lamblin, P.4
-
41
-
-
85161980001
-
Sparse deep belief net model for visual area V2
-
H. Lee, C. Ekanadham, and A. Ng, "Sparse deep belief net model for visual area V2," Advances in neural information processing systems, vol. 20, 2008.
-
(2008)
Advances in Neural Information Processing Systems
, vol.20
-
-
Lee, H.1
Ekanadham, C.2
Ng, A.3
-
44
-
-
73249147663
-
The difficulty of training deep architectures and the effect of unsupervised pre-training
-
D. Erhan, P. Manzagol, Y. Bengio, S. Bengio, and P. Vincent, "The difficulty of training deep architectures and the effect of unsupervised pre-training," AISTATS, 2009.
-
(2009)
AISTATS
-
-
Erhan, D.1
Manzagol, P.2
Bengio, Y.3
Bengio, S.4
Vincent, P.5
|