SCOPUS 정보 검색 플랫폼

Neural Computation

Volumn 21, Issue 6, 2009, Pages 1601-1621

Justifying and generalizing contrastive divergence

(2) Bengio, Yoshua a Delalleau, Olivier a

a UNIVERSITÉ DE MONTRÉAL (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; EPIDEMIOLOGY; HUMAN; LEARNING; PHYSIOLOGY; STATISTICAL MODEL;

BIAS (EPIDEMIOLOGY); HUMANS; LEARNING; LIKELIHOOD FUNCTIONS; MODELS, STATISTICAL;

EID: 67651049775 PISSN: 08997667 EISSN: 1530888X Source Type: Journal
DOI: 10.1162/neco.2008.11-07-647 Document Type: Letter

Times cited : (191)

References (26)

1
- 84864073449
- Greedy layer-wise training of deep networks
- B. Schölkopf, J. Platt, & T. Hoffman (Eds.), Cambridge, MA: MIT Press
- Bengio, Y., Lamblin, P., Popovici, D., & Larochelle, H. (2007). Greedy layer-wise training of deep networks. In B. Schölkopf, J. Platt, & T. Hoffman (Eds.), Advances in neural information processing systems, 19 (pp. 153-160). Cambridge, MA: MIT Press.
- (2007) Advances in neural information processing systems , vol.19 , pp. 153-160
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴

2
- 34547975052
- Scaling learning algorithms towards AI
- L. Bottou, O. Chapelle, D. DeCoste, & J. Weston (Eds.), Cambridge, MA: MIT Press
- Bengio, Y., & Le Cun, Y. (2007). Scaling learning algorithms towards AI. In L. Bottou, O. Chapelle, D. DeCoste, & J. Weston (Eds.), Large scale kernel machines. Cambridge, MA: MIT Press.
- (2007) Large scale kernel machines
- Bengio, Y.¹ Le Cun, Y.²

3
- 0024220237
- Auto-association by multilayer perceptrons and singular value decomposition
- Bourlard, H., & Kamp, Y. (1988). Auto-association by multilayer perceptrons and singular value decomposition. Biological Cybernetics, 59, 291-294.
- (1988) Biological Cybernetics , vol.59 , pp. 291-294
- Bourlard, H.¹ Kamp, Y.²

4
- 84862612564
- On contrastive divergence learning
- R. G. Cowell, & Z. Ghahramani (Eds.), N. P.: Society for Artificial Intelligence and Statistics
- Carreira-Perpiñan, M. A., & Hinton, G. E. (2005). On contrastive divergence learning. In R. G. Cowell, & Z. Ghahramani (Eds.), Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics (pp. 33-40). N. P.: Society for Artificial Intelligence and Statistics.
- (2005) Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics , pp. 33-40
- Carreira-Perpiñan, M.A.¹ Hinton, G.E.²

5
- 56449085852
- (Tech. Rep. UCSC-CRL-94-25), Santa Cruz: University of California, Santa Cruz
- Freund, Y., & Haussler, D. (1994). Unsupervised learning of distributions on binary vectors using two layer networks (Tech. Rep. UCSC-CRL-94-25). Santa Cruz: University of California, Santa Cruz.
- (1994) Unsupervised learning of distributions on binary vectors using two layer networks
- Freund, Y.¹ Haussler, D.²

6
- 1642310071
- Basel: Birkhäuser Verlag
- Hernández-Lerma, O., & Lasserre, J. B. (2003). Markov chains and invariant probabilities. Basel: Birkhäuser Verlag.
- (2003) Markov chains and invariant probabilities
- Hernández-Lerma, O.¹ Lasserre, J.B.²

7
- 0033350721
- Products of experts
- New York: IEEE
- Hinton, G. E. (1999). Products of experts. In Proceedings of the Ninth International Conference on Artificial Neural Networks (ICANN) (Vol. 1, pp. 1-6). New York: IEEE.
- (1999) Proceedings of the Ninth International Conference on Artificial Neural Networks (ICANN) , vol.1 , pp. 1-6
- Hinton, G.E.¹

8
- 0013344078
- Training products of experts by minimizing contrastive divergence
- Hinton, G. E. (2002). Training products of experts by minimizing contrastive divergence. Neural Computation, 14, 1771-1800.
- (2002) Neural Computation , vol.14 , pp. 1771-1800
- Hinton, G.E.¹

9
- 33745805403
- A fast learning algorithm for deep belief nets
- Hinton, G. E., Osindero, S., & Teh, Y. (2006). A fast learning algorithm for deep belief nets. Neural Computation, 18, 1527-1554.
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.³

10
- 33746600649
- Reducing the dimensionality of data with neural networks
- Hinton, G. E., & Salakhutdinov, R. (2006). Reducing the dimensionality of data with neural networks. Science, 313, 504-507.
- (2006) Science , vol.313 , pp. 504-507
- Hinton, G.E.¹ Salakhutdinov, R.²

11
- 0000999440
- Learning and relearning in Boltzmann machines
- D. E. Rumelhart & J. L. McClelland (Eds.), Foundations. Cambridge, MA: MIT Press
- Hinton, G. E., & Sejnowski, T. J. (1986). Learning and relearning in Boltzmann machines. In D. E. Rumelhart & J. L. McClelland (Eds.), Parallel distributed processing: Explorations in the microstructure of cognition. Vol. 1: Foundations. Cambridge, MA: MIT Press.
- (1986) Parallel distributed processing: Explorations in the microstructure of cognition , vol.1
- Hinton, G.E.¹ Sejnowski, T.J.²

12
- 0004243089
- (Tech. Rep. TR-CMU-CS-84-119), Pittsburgh, PA: Carnegie Mellon University, Department of Computer Science
- Hinton, G. E., Sejnowski, T. J., & Ackley, D. H. (1984). Boltzmann machines: Constraint satisfaction networks that learn (Tech. Rep. TR-CMU-CS-84-119). Pittsburgh, PA: Carnegie Mellon University, Department of Computer Science.
- (1984) Boltzmann machines: Constraint satisfaction networks that learn
- Hinton, G.E.¹ Sejnowski, T.J.² Ackley, D.H.³

13
- 0002834189
- Autoencoders, minimum description length, and Helmholtz free energy
- D. Cowan, G. Tesauro, & J. Alspector (Eds.), San Francisco: Morgan Kaufmann
- Hinton, G. E., & Zemel, R. S. (1994). Autoencoders, minimum description length, and Helmholtz free energy. In D. Cowan, G. Tesauro, & J. Alspector (Eds.), Advances in neural information processing systems, 6 (pp. 3-10). San Francisco: Morgan Kaufmann.
- (1994) Advances in neural information processing systems , vol.6 , pp. 3-10
- Hinton, G.E.¹ Zemel, R.S.²

14
- 0034153465
- Nonlinear autoassociation is not equivalent to PCA
- Japkowicz, N., Hanson, S. J., & Gluck, M. A. (2000). Nonlinear autoassociation is not equivalent to PCA. Neural Computation, 12(3), 531-545.
- (2000) Neural Computation , vol.12 , Issue.3 , pp. 531-545
- Japkowicz, N.¹ Hanson, S.J.² Gluck, M.A.³

15
- 34547967782
- An empirical evaluation of deep architectures on problems with many factors of variation
- Z. Ghahramani (Ed.), Madison, WI: Omnipress
- Larochelle, H., Erhan, D., Courville, A., Bergstra, J., & Bengio, Y. (2007). An empirical evaluation of deep architectures on problems with many factors of variation. In Z. Ghahramani (Ed.), Twenty-Fourth International Conference on Machine Learning (ICML'2007) (pp. 473-480). Madison, WI: Omnipress.
- (2007) Twenty-Fourth International Conference on Machine Learning (ICML'2007) , pp. 473-480
- Larochelle, H.¹ Erhan, D.² Courville, A.³ Bergstra, J.⁴ Bengio, Y.⁵

16
- 51349119518
- Unpublished manuscript
- MacKay, D. (2001). Failures of the one-step learning algorithm. Unpublished manuscript.
- (2001) Failures of the one-step learning algorithm
- MacKay, D.¹

17
- 84864069017
- Efficient learning of sparse representations with an energy-based model
- B. Schölkopf, J. Platt, & T. Hoffman (Eds.), Cambridge, MA: MIT Press
- Ranzato, M., Poultney, C., Chopra, S., & Le Cun, Y. (2007). Efficient learning of sparse representations with an energy-based model. In B. Schölkopf, J. Platt, & T. Hoffman (Eds.), Advances in neural information processing systems, 19.Cambridge, MA: MIT Press.
- (2007) Advances in neural information processing systems , vol.19
- Ranzato, M.¹ Poultney, C.² Chopra, S.³ Le Cun, Y.⁴

18
- 0022471098
- Learning representations by back-propagating errors
- Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning representations by back-propagating errors. Nature, 323, 533-536.
- (1986) Nature , vol.323 , pp. 533-536
- Rumelhart, D.E.¹ Hinton, G.E.² Williams, R.J.³

19
- 70350693506
- Semantic hashing
- A. McCallum & S. Roweis (Eds.), Amsterdam: Elsevier
- Salakhutdinov, R., & Hinton, G. (2007). Semantic hashing. In A. McCallum & S. Roweis (Eds.), Proceedings of the 2007 Workshop on Information Retrieval and Applications of Graphical Models (SIGIR 2007). Amsterdam: Elsevier.
- (2007) Proceedings of the 2007 Workshop on Information Retrieval and Applications of Graphical Models (SIGIR 2007)
- Salakhutdinov, R.¹ Hinton, G.²

20
- 72249121023
- Markov chains and Monte-Carlo simulation
- Ulm: Ulm University, Department of Stochastics
- Schmidt, V. (2006). Markov chains and Monte-Carlo simulation. In Lecture Notes, Summer 2006. Ulm: Ulm University, Department of Stochastics.
- (2006) Lecture Notes, Summer 2006
- Schmidt, V.¹

21
- 85153961469
- Transformation invariant autoassociation with application to handwritten character recognition
- G. Tesauro, D. Touretzky, & T. Leen (Eds.), Cambridge, MA: MIT Press
- Schwenk, H., & Milgram, M. (1995). Transformation invariant autoassociation with application to handwritten character recognition. In G. Tesauro, D. Touretzky, & T. Leen (Eds.), Advances in neural information processing systems, 7 (pp. 991-998). Cambridge, MA: MIT Press.
- (1995) Advances in neural information processing systems , vol.7 , pp. 991-998
- Schwenk, H.¹ Milgram, M.²

22
- 0000329993
- Information processing in dynamical systems: Foundations of harmony theory
- D. E. Rumelhart & J. L. McClelland (Eds.), Cambridge, MA: MIT Press
- Smolensky, P. (1986). Information processing in dynamical systems: Foundations of harmony theory. In D. E. Rumelhart & J. L. McClelland (Eds.), Parallel distributed processing (Vol. 1, pp. 194-281). Cambridge, MA: MIT Press.
- (1986) Parallel distributed processing , vol.1 , pp. 194-281
- Smolensky, P.¹

23
- 84864026688
- Modeling human motion using binary latent variables
- B. Schölkopf, J. Platt, & T. Hoffman (Eds.), Cambridge, MA: MIT Press
- Taylor, G., Hinton, G., & Roweis, S. (2007). Modeling human motion using binary latent variables. In B. Schölkopf, J. Platt, & T. Hoffman (Eds.), Advances in neural information processing systems, 19 (pp. 1345-1352). Cambridge, MA: MIT Press.
- (2007) Advances in neural information processing systems , vol.19 , pp. 1345-1352
- Taylor, G.¹ Hinton, G.² Roweis, S.³

24
- 56449086223
- Training restricted Boltzmann machines using approximations to the likelihood gradient
- A. McCallum & S. Roweis (Eds.), Madison, WI: Omnipress
- Tieleman, T. (2008). Training restricted Boltzmann machines using approximations to the likelihood gradient. In A. McCallum & S. Roweis (Eds.), Proceedings of the International Conference on Machine Learning (Vol. 25, pp. 1064-1071). Madison, WI: Omnipress.
- (2008) Proceedings of the International Conference on Machine Learning , vol.25 , pp. 1064-1071
- Tieleman, T.¹

25
- 84899000641
- Exponential family harmoniums with an application to information retrieval
- L. Saul, Y. Weiss, & L. Bottou (Eds.), Cambridge, MA: MIT Press
- Welling, M., Rosen-Zvi, M., & Hinton, G. E. (2005). Exponential family harmoniums with an application to information retrieval. In L. Saul, Y. Weiss, & L. Bottou (Eds.), Advances in neural information processing systems, 17. Cambridge, MA: MIT Press.
- (2005) Advances in neural information processing systems , vol.17
- Welling, M.¹ Rosen-Zvi, M.² Hinton, G.E.³

26
- 84899029362
- The convergence of contrastive divergences
- L. Saul, Y. Weiss, & L. Bottou (Eds.), Cambridge, MA: MIT Press
- Yuille, A. L. (2005). The convergence of contrastive divergences. In L. Saul, Y. Weiss, & L. Bottou (Eds.), Advances in neural information processing systems, 17 (pp. 1593-1600). Cambridge, MA: MIT Press.
- (2005) Advances in neural information processing systems , vol.17 , pp. 1593-1600
- Yuille, A.L.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.