SCOPUS 정보 검색 플랫폼

2010 Information Theory and Applications Workshop, ITA 2010 - Conference Proceedings

Volumn , Issue , 2010, Pages 80-89

A tutorial on stochastic approximation algorithms for training restricted Boltzmann machines and deep belief nets

(4) Swersky, Kevin a Chen, Bo a Marlin, Ben a De Freitas, Nando a

a UNIVERSITY OF BRITISH COLUMBIA (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

BELIEF NETWORKS; CONTRASTIVE DIVERGENCE; DATA SETS; FINE TUNING; OPTIMAL PARAMETER; OPTIMAL RESULTS; PARAMETER CHANGES; RESTRICTED BOLTZMANN MACHINE; STOCHASTIC APPROXIMATION ALGORITHMS; STOCHASTIC APPROXIMATIONS; STOCHASTIC MAXIMUM LIKELIHOOD ALGORITHMS;

APPROXIMATION THEORY; INFORMATION THEORY; MAXIMUM LIKELIHOOD; OPTIMIZATION; STOCHASTIC SYSTEMS;

APPROXIMATION ALGORITHMS;

EID: 77952681438 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ITA.2010.5454138 Document Type: Conference Paper

Times cited : (60)

References (44)

1
- 33746600649
- Reducing the dimensionality of data with neural networks
- G. Hinton and R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
- (2006) Science , vol.313 , Issue.5786 , pp. 504-507
- Hinton, G.¹ Salakhutdinov, R.²

2
- 34547983260
- Restricted Boltzmann machines for collaborative filtering
- R. Salakhutdinov, A. Mnih, and G. Hinton, "Restricted Boltzmann machines for collaborative filtering," in International Conference on Machine learning, 2007, pp. 791-798.
- International Conference on Machine Learning, 2007 , pp. 791-798
- Salakhutdinov, R.¹ Mnih, A.² Hinton, G.³

3
- 84906549138
- Semantic hashing
- R. Salakhutdinov and G. Hinton, "Semantic hashing," International Journal of Approximate Reasoning, 2008.
- (2008) International Journal of Approximate Reasoning
- Salakhutdinov, R.¹ Hinton, G.²

4
- 84864073449
- Greedy layer-wise training of deep networks
- MIT Press
- Y. Bengio, P. Lamblin, D. Popovici, H. Larochelle, and U. Montreal, "Greedy layer-wise training of deep networks," in Advances in Neural Information Processing Systems. MIT Press, 2007.
- (2007) Advances in Neural Information Processing Systems
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴ Montreal, U.⁵

5
- 34547975052
- Scaling learning algorithms towards AI
- Y. Bengio and Y. Le Cun, "Scaling learning algorithms towards AI," Large-Scale Kernel Machines, 2007.
- (2007) Large-Scale Kernel Machines
- Bengio, Y.¹ Le Cun, Y.²

6
- 71149119164
- Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
- H. Lee, R. Grosse, R. Ranganath, and A. Ng, "Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations," in International Conference on Machine Learning, 2009.
- International Conference on Machine Learning, 2009
- Lee, H.¹ Grosse, R.² Ranganath, R.³ Ng, A.⁴

7
- 0036546660
- Slow feature analysis: Unsupervised learning of invariances
- L. Wiskott and T. Sejnowski, "Slow feature analysis: Unsupervised learning of invariances," Neural Computation, vol. 14, no. 4, pp. 715-770, 2002.
- (2002) Neural Computation , vol.14 , Issue.4 , pp. 715-770
- Wiskott, L.¹ Sejnowski, T.²

8
- 33847305334
- Numenta, Tech. Rep.
- J. Hawkins and D. George, "Hierarchical temporal memory: Concepts, theory and terminology," Numenta, Tech. Rep., 2006.
- (2006) Hierarchical Temporal Memory: Concepts, Theory and Terminology
- Hawkins, J.¹ George, D.²

9
- 84860644702
- Measuring invariances in deep networks
- I. J. Goodfellow, Q. V. Le, A. M. Saxe, H. Lee, and A. Y. Ng., "Measuring invariances in deep networks," Advances in neural information processing systems, 2009.
- (2009) Advances in Neural Information Processing Systems
- Goodfellow, I.J.¹ Le, Q.V.² Saxe, A.M.³ Lee, H.⁴ Ng, A.Y.⁵

10
- 33745805403
- A fast learning algorithm for deep belief nets
- G. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, no. 7, pp. 1527-1554, 2006.
- (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
- Hinton, G.¹ Osindero, S.² Teh, Y.³

11
- 0000355193
- Parametric inference for imperfectly observed Gibbsian fields
- L. Younes, "Parametric inference for imperfectly observed Gibbsian fields," Probability Theory and Related Fields, vol. 82, no. 4, pp. 625-645, 1989.
- (1989) Probability Theory and Related Fields , vol.82 , Issue.4 , pp. 625-645
- Younes, L.¹

12
- 56449086223
- Training restricted Boltzmann machines using approximations to the likelihood gradient
- T. Tieleman, "Training restricted Boltzmann machines using approximations to the likelihood gradient," in International conference on Machine Learning, 2008, pp. 1064-1071.
- International Conference on Machine Learning, 2008 , pp. 1064-1071
- Tieleman, T.¹

13
- 84899000641
- Exponential family harmoniums with an application to information retrieval
- M. Welling, M. Rosen-Zvi, and G. Hinton, "Exponential family harmoniums with an application to information retrieval," Advances in neural information processing systems, vol. 17, pp. 1481-1488, 2005.
- (2005) Advances in Neural Information Processing Systems , vol.17 , pp. 1481-1488
- Welling, M.¹ Rosen-Zvi, M.² Hinton, G.³

14
- 0013344078
- Training products of experts by minimizing contrastive divergence
- G. E. Hinton, "Training products of experts by minimizing contrastive divergence," Neural Computation, vol. 14, p. 2002, 2002.
- (2002) Neural Computation , vol.14 , pp. 2002
- Hinton, G.E.¹

15
- 84862612564
- On contrastive divergence learning
- M. Carreira-Perpinan and G. Hinton, "On contrastive divergence learning," in Artificial Intelligence and Statistics, vol. 2005, 2005.
- (2005) Artificial Intelligence and Statistics , vol.2005
- Carreira-Perpinan, M.¹ Hinton, G.²

16
- 77952677885
- B. Marlin, "A direct proof that the true RBM parameters are a fixed point of both ML and CD1 in the asymptotic setting," 2008.
- (2008) A Direct Proof That the True RBM Parameters Are a Fixed Point of Both ML and CD1 in the Asymptotic Setting
- Marlin, B.¹

17
- 48849114847
- The convergence of contrastive divergences
- A. Yuille, "The convergence of contrastive divergences," in Advances in Neural Information Processing Systems, 2004.
- (2004) Advances in Neural Information Processing Systems
- Yuille, A.¹

18
- 0003778897
- Springer-Verlag
- A. Benveniste, M. Métivier, and P. Priouret, Adaptive algorithms and stochastic approximations. Springer-Verlag, 1990.
- (1990) Adaptive Algorithms and Stochastic Approximations
- Benveniste, A.¹ Métivier, M.² Priouret, P.³

19
- 0043224019
- A Newton-Raphson version of the multivariate Robbins-Monro procedure
- D. Ruppert, "A Newton-Raphson version of the multivariate Robbins-Monro procedure," Ann. Statist., vol. 13, no. 1, pp. 236-245, 1985.
- (1985) Ann. Statist. , vol.13 , Issue.1 , pp. 236-245
- Ruppert, D.¹

20
- 0032260190
- Adaptive stochastic approximation by the simultaneous perturbation method
- J. Spall, "Adaptive stochastic approximation by the simultaneous perturbation method," IEEE Conference on Decision and Control, pp. 3872-3879, 1998.
- (1998) IEEE Conference on Decision and Control , pp. 3872-3879
- Spall, J.¹

21
- 0000828406
- A new method of stochastic approximation type
- B. T. Polyak, "A new method of stochastic approximation type," Avtomat. i Telemekh., no. 7, pp. 98-107, 1990.
- (1990) Avtomat. I Telemekh. , Issue.7 , pp. 98-107
- Polyak, B.T.¹

22
- 0026899240
- Acceleration of stochastic approximation by averaging
- B. Polyak and A. Juditsky, "Acceleration of stochastic approximation by averaging," SIAM Journal on Control and Optimization, vol. 30, p. 838, 1992.
- (1992) SIAM Journal on Control and Optimization , vol.30 , pp. 838
- Polyak, B.¹ Juditsky, A.²

23
- 0019608150
- Averaging methods for the asymptotic analysis of learning and adaptive systems, with small adjustment rate
- H. J. Kushner and H. Huang, "Averaging methods for the asymptotic analysis of learning and adaptive systems, with small adjustment rate," SIAM J. Control Optim., vol. 19, no. 5, pp. 635-650, 1981.
- (1981) SIAM J. Control Optim. , vol.19 , Issue.5 , pp. 635-650
- Kushner, H.J.¹ Huang, H.²

24
- 0002824293
- Asymptotic properties of stochastic approximations with constant coefficients
- -, "Asymptotic properties of stochastic approximations with constant coefficients," SIAM J. Control Optim., vol. 19, no. 1, pp. 87-105, 1981.
- (1981) SIAM J. Control Optim. , vol.19 , Issue.1 , pp. 87-105

25
- 0004066022
- Springer-Verlag
- H. J. Kushner and G. G. Yin, Stochastic Approximation Algorithms and Applications. Springer-Verlag, 1997.
- (1997) Stochastic Approximation Algorithms and Applications
- Kushner, H.J.¹ Yin, G.G.²

26
- 0003723679
- MIT Press
- L. Ljung and T. Söderström, Theory and practice of recursive identification. MIT Press, 1983.
- (1983) Theory and Practice of Recursive Identification
- Ljung, L.¹ Söderström, T.²

27
- 58849087743
- Cambridge University Press
- V. Borkar, Stochastic Approximation: A Dynamical Systems Viewpoint. Cambridge University Press, 2008.
- (2008) Stochastic Approximation: A Dynamical Systems Viewpoint
- Borkar, V.¹

28
- 0346881152
- Steepest descent with momentum for quadratic functions is a version of the conjugate gradient method
- A. Bhaya and E. Kaszkurewicz, "Steepest descent with momentum for quadratic functions is a version of the conjugate gradient method," Neural Networks, vol. 17, no. 1, pp. 65-71, 2004.
- (2004) Neural Networks , vol.17 , Issue.1 , pp. 65-71
- Bhaya, A.¹ Kaszkurewicz, E.²

29
- 0032069997
- Analysis of momentum adaptive filtering algorithms
- May
- R. Sharma, W. Sethares, and J. Bucklew, "Analysis of momentum adaptive filtering algorithms," IEEE Transactions on Signal Processing, vol. 46, no. 5, pp. 1430-1434, May 1998.
- (1998) IEEE Transactions on Signal Processing , vol.46 , Issue.5 , pp. 1430-1434
- Sharma, R.¹ Sethares, W.² Bucklew, J.³

30
- 63849327496
- Hopfield network
- J. J. Hopfield, "Hopfield network," Scholarpedia, 2007.
- (2007) Scholarpedia
- Hopfield, J.J.¹

31
- 79959651429
- Herding Dynamic Weights for Partially Observed Random Field Models
- M. Welling, "Herding Dynamic Weights for Partially Observed Random Field Models," in UAI, 2009.
- (2009) UAI
- Welling, M.¹

32
- 77952686979
- Generalization error bounds for aggregation by mirror descent with averaging
- A. Juditsky, A. Nazin, A. Tsybakov, and N. Vayatis, "Generalization error bounds for aggregation by mirror descent with averaging," Advances in neural information processing systems, 2005.
- (2005) Advances in Neural Information Processing Systems
- Juditsky, A.¹ Nazin, A.² Tsybakov, A.³ Vayatis, N.⁴

33
- 0030242092
- General results on the convergence of stochastic algorithms
- B. Delyon, "General results on the convergence of stochastic algorithms," IEEE Transactions on Automatic Control, vol. 41, no. 9, pp. 1245-1255, 1996.
- (1996) IEEE Transactions on Automatic Control , vol.41 , Issue.9 , pp. 1245-1255
- Delyon, B.¹

34
- 57849088168
- A tutorial on adaptive MCMC
- C. Andrieu and J. Thoms, "A tutorial on adaptive MCMC," Statistics and Computing, vol. 18, no. 4, pp. 343-373, 2008.
- (2008) Statistics and Computing , vol.18 , Issue.4 , pp. 343-373
- Andrieu, C.¹ Thoms, J.²

35
- 0003487482
- Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

36
- 6344235947
- NEC Research Institute
- Y. LeCun and C. Cortes, "The MNIST database of handwritten digits," NEC Research Institute, http://yann. lecun. com/exdb/mnist/index. html.
- The MNIST Database of Handwritten Digits
- LeCun, Y.¹ Cortes, C.²

37
- 50549197532
- Some methods of speeding up the convergence of iterative methods
- B. Polyak, "Some methods of speeding up the convergence of iterative methods," USSR Computational Mathematics and Mathematical Physics., vol. 4, pp. 1-17, 1964.
- (1964) USSR Computational Mathematics and Mathematical Physics , vol.4 , pp. 1-17
- Polyak, B.¹

38
- 85162035281
- The tradeoffs of large scale learning
- L. Bottou and O. Bousquet, "The tradeoffs of large scale learning," Advances in neural information processing systems, vol. 20, 2007.
- (2007) Advances in Neural Information Processing Systems , vol.20
- Bottou, L.¹ Bousquet, O.²

39
- 77952726973
- R. Salakhutdinov and G. Hinton, "Training a deep autoencoder or a classifier on MNIST digits - source code," 2006.
- (2006) Training a Deep Autoencoder or a Classifier on MNIST Digits - Source Code
- Salakhutdinov, R.¹ Hinton, G.²

40
- 59449087310
- Exploring Strategies for Training Deep Neural Networks
- H. Larochelle, Y. Bengio, J. Louradour, and P. Lamblin, "Exploring Strategies for Training Deep Neural Networks," Journal of Machine Learning Research, vol. 1, pp. 1-40, 2009.
- (2009) Journal of Machine Learning Research , vol.1 , pp. 1-40
- Larochelle, H.¹ Bengio, Y.² Louradour, J.³ Lamblin, P.⁴

41
- 85161980001
- Sparse deep belief net model for visual area V2
- H. Lee, C. Ekanadham, and A. Ng, "Sparse deep belief net model for visual area V2," Advances in neural information processing systems, vol. 20, 2008.
- (2008) Advances in Neural Information Processing Systems , vol.20
- Lee, H.¹ Ekanadham, C.² Ng, A.³

42
- 85162008868
- Learning horizontal connections in a sparse coding model of natural images
- P. Garrigues and B. Olshausen, "Learning horizontal connections in a sparse coding model of natural images," Advances in Neural Information Processing Systems, vol. 20, pp. 505-512, 2008.
- (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 505-512
- Garrigues, P.¹ Olshausen, B.²

43
- 56449110012
- Classification using discriminative restricted Boltzmann machines
- H. Larochelle and Y. Bengio, "Classification using discriminative restricted Boltzmann machines," in International Conference on Machine learning, 2008, pp. 536-543.
- International Conference on Machine Learning, 2008 , pp. 536-543
- Larochelle, H.¹ Bengio, Y.²

44
- 73249147663
- The difficulty of training deep architectures and the effect of unsupervised pre-training
- D. Erhan, P. Manzagol, Y. Bengio, S. Bengio, and P. Vincent, "The difficulty of training deep architectures and the effect of unsupervised pre-training," AISTATS, 2009.
- (2009) AISTATS
- Erhan, D.¹ Manzagol, P.² Bengio, Y.³ Bengio, S.⁴ Vincent, P.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.