SCOPUS 정보 검색 플랫폼

Journal of Machine Learning Research

Volumn 15, Issue , 2014, Pages 2949-2980

Multimodal learning with Deep Boltzmann Machines

(2) Srivastava, Nitish a Salakhutdinov, Ruslan a

a UNIVERSITY OF TORONTO (Canada)

Author keywords

Boltzmann machines; Deep learning; Multimodal learning; Neural networks; Unsupervised learning

Indexed keywords

ARTIFICIAL INTELLIGENCE; NEURAL NETWORKS; UNSUPERVISED LEARNING;

BOLTZMANN MACHINES; CLASSIFICATION RESULTS; CONDITIONAL DISTRIBUTION; DEEP BOLTZMANN MACHINES; DEEP LEARNING; MULTI-MODAL LEARNING; MULTIPLE KERNEL LEARNING; STATISTICAL PROPERTIES;

CLASSIFICATION (OF INFORMATION);

EID: 84916911784 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (406)

References (41)

1
- 85008039046
- Bilvideo-7: An MPEG-7- compatible video indexing and retrieval system
- M. Bastan, H. Cam, U. Gudukbay, and O. Ulusoy. Bilvideo-7: An MPEG-7- compatible video indexing and retrieval system. IEEE Multimedia, 17:62-73, 2010. ISSN 1070-986X.
- (2010) IEEE Multimedia , vol.17 , pp. 62-73
- Bastan, M.¹ Cam, H.² Gudukbay, U.³ Ulusoy, O.⁴

2
- 50649101132
- Image classification using random forests and ferns
- A. Bosch, A. Zisserman, and X. Munoz. Image classification using random forests and ferns. IEEE 11th International Conference on Computer Vision (2007), 23:1-8, 2007.
- (2007) IEEE 11th International Conference on Computer Vision (2007) , vol.23 , pp. 1-8
- Bosch, A.¹ Zisserman, A.² Munoz, X.³

3
- 85133487154
- The challenge of multispeaker lip-reading
- September
- S. Cox, R. Harvey, Y. Lan, J. Newman, and B. Theobald. The challenge of multispeaker lip-reading. In International Conference on Auditory-Visual Speech Processing, pages 179-184, September 2008.
- (2008) International Conference on Auditory-Visual Speech Processing , pp. 179-184
- Cox, S.¹ Harvey, R.² Lan, Y.³ Newman, J.⁴ Theobald, B.⁵

4
- 33645146449
- Histograms of oriented gradients for human detection
- N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In IEEE Conference on Computer Vision and Pattern Recognition, pages 886-893, 2005.
- (2005) IEEE Conference on Computer Vision and Pattern Recognition , pp. 886-893
- Dalal, N.¹ Triggs, B.²

5
- 0002538142
- The DARPA speech recognition research database: Specifications and status
- W. M. Fisher, G. R. Doddington, and K. M. Goudie-Marshall. The DARPA speech recognition research database: Specifications and status. In Proceedings of DARPA Workshop on Speech Recognition, pages 93-99, 1986.
- (1986) Proceedings of DARPA Workshop on Speech Recognition , pp. 93-99
- Fisher, W.M.¹ Doddington, G.R.² Goudie-Marshall, K.M.³

6
- 56449085852
- Technical report, University of California at Santa Cruz, Santa Cruz, CA, USA
- Y. Freund and D. Haussler. Unsupervised learning of distributions on binary vectors using two layer networks. Technical report, University of California at Santa Cruz, Santa Cruz, CA, USA, 1994.
- (1994) Unsupervised Learning of Distributions on Binary Vectors Using Two Layer Networks
- Freund, Y.¹ Haussler, D.²

7
- 77956006653
- Multimodal semi-supervised learning for image classification
- M. Guillaumin, J. Verbeek, and C. Schmid. Multimodal semi-supervised learning for image classification. In IEEE Conference on Computer Vision and Pattern Recognition, pages 902-909, 2010.
- (2010) IEEE Conference on Computer Vision and Pattern Recognition , pp. 902-909
- Guillaumin, M.¹ Verbeek, J.² Schmid, C.³

8
- 70450273199
- Information theoretic feature extraction for audio-visual speech recognition
- M. Gurban and J.-P. Thiran. Information theoretic feature extraction for audio-visual speech recognition. IEEE Transactions on Signal Processing, 57(12):4765-4776, 2009.
- (2009) IEEE Transactions on Signal Processing , vol.57 , Issue.12 , pp. 4765-4776
- Gurban, M.¹ Thiran, J.-P.²

9
- 33746600649
- Reducing the dimensionality of data with neural networks
- G. Hinton and R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504-507, 2006.
- (2006) Science , vol.313 , Issue.5786 , pp. 504-507
- Hinton, G.¹ Salakhutdinov, R.²

10
- 0013344078
- Training products of experts by minimizing contrastive divergence
- G. E. Hinton. Training products of experts by minimizing contrastive divergence. Neural Computation, 14(8):1711-1800, 2002.
- (2002) Neural Computation , vol.14 , Issue.8 , pp. 1711-1800
- Hinton, G.E.¹

11
- 33745805403
- A fast learning algorithm for deep belief nets
- G. E. Hinton, S. Osindero, and Y. W. Teh. A fast learning algorithm for deep belief nets. Neural Computation, 18(7):1527-1554, 2006.
- (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.W.³

12
- 84890466217
- Improving neural networks by preventing co-adaptation of feature detectors
- abs/1207.0580
- G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Improving neural networks by preventing co-adaptation of feature detectors. CoRR, abs/1207.0580, 2012.
- (2012) CoRR
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

13
- 70449621223
- The MIR Flickr retrieval evaluation
- M. J. Huiskes and M. S. Lew. The MIR Flickr retrieval evaluation. In ACM International Conference on Multimedia Information Retrieval, 2008.
- ACM International Conference on Multimedia Information Retrieval, 2008
- Huiskes, M.J.¹ Lew, M.S.²

14
- 77952328425
- New trends and ideas in visual concept detection: The MIR Flickr retrieval evaluation initiative
- M. J. Huiskes, B. Thomee, and M. S. Lew. New trends and ideas in visual concept detection: the MIR Flickr retrieval evaluation initiative. In 11th ACM International Conference on Multimedia Information Retrieval, pages 527-536, 2010.
- (2010) 11th ACM International Conference on Multimedia Information Retrieval , pp. 527-536
- Huiskes, M.J.¹ Thomee, B.² Lew, M.S.³

15
- 71149119164
- Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
- H. Lee, R. Grosse, R. Ranganath, and A. Y. Ng. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In Proceedings of the 26th International Conference on Machine Learning, pages 609-616, 2009.
- (2009) Proceedings of the 26th International Conference on Machine Learning , pp. 609-616
- Lee, H.¹ Grosse, R.² Ranganath, R.³ Ng, A.Y.⁴

16
- 84890504350
- Patch-based representation of visual speech
- VisHCI '06, Australian Computer Society, Inc.
- P. Lucey and S. Sridharan. Patch-based representation of visual speech. In Proceedings of the HCSNet workshop on Use of vision in human-computer interaction - Volume 56, VisHCI '06, pages 79-85. Australian Computer Society, Inc., 2006.
- (2006) Proceedings of the HCSNet Workshop on Use of Vision in Human-computer Interaction , vol.56 , pp. 79-85
- Lucey, P.¹ Sridharan, S.²

17
- 0035365392
- Color and texture descriptors
- B. Manjunath, J.-R. Ohm, V. Vasudevan, and A. Yamada. Color and texture descriptors. Circuits and Systems for Video Technology, IEEE Transactions on, 11(6):703-715, 2001.
- (2001) Circuits and Systems for Video Technology, IEEE Transactions on , vol.11 , Issue.6 , pp. 703-715
- Manjunath, B.¹ Ohm, J.-R.² Vasudevan, V.³ Yamada, A.⁴

18
- 0036472941
- Extraction of visual features for lipreading
- Feb.
- I. Matthews, T. F. Cootes, J. A. Bangham, S. Cox, and R. Harvey. Extraction of visual features for lipreading. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(2):198-213, Feb. 2002.
- (2002) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.24 , Issue.2 , pp. 198-213
- Matthews, I.¹ Cootes, T.F.² Bangham, J.A.³ Cox, S.⁴ Harvey, R.⁵

19
- 80051624332
- Acoustic modeling using deep belief networks
- A. Mohamed, G. Dahl, and G. Hinton. Acoustic modeling using deep belief networks. IEEE Transactions on Audio, Speech, and Language Processing, 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

20
- 80053437179
- Multimodal deep learning
- J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Y. Ng. Multimodal deep learning. In Proceedings of the 28th International Conference on Machine Learning, 2011.
- Proceedings of the 28th International Conference on Machine Learning, 2011
- Ngiam, J.¹ Khosla, A.² Kim, M.³ Nam, J.⁴ Lee, H.⁵ Ng, A.Y.⁶

21
- 0035328421
- Modeling the shape of the scene: A holistic representation of the spatial envelope
- A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42:145-175, 2001.
- (2001) International Journal of Computer Vision , vol.42 , pp. 145-175
- Oliva, A.¹ Torralba, A.²

22
- 0029938380
- Emergence of simple-cell receptive field properties by learning a sparse code for natural images
- B. A. Olshausen and D. J. Field. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature, 381(6583):607-609, 1996.
- (1996) Nature , vol.381 , Issue.6583 , pp. 607-609
- Olshausen, B.A.¹ Field, D.J.²

23
- 48149100910
- Multimodal fusion and learning with uncertain features applied to audiovisual speech recognition
- G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos. Multimodal fusion and learning with uncertain features applied to audiovisual speech recognition. In IEEE 9th Workshop on Multimedia Signal Processing, pages 264-267, 2007.
- (2007) IEEE 9th Workshop on Multimedia Signal Processing , pp. 264-267
- Papandreou, G.¹ Katsamanis, A.² Pitsikalis, V.³ Maragos, P.⁴

24
- 69849103259
- Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition
- G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos. Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(3):423-435, 2009.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.3 , pp. 423-435
- Papandreou, G.¹ Katsamanis, A.² Pitsikalis, V.³ Maragos, P.⁴

25
- 0036299249
- Cuave: A new audio-visual database for multimodal human-computer interface research
- E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy. Cuave: A new audio-visual database for multimodal human-computer interface research. In International Conference on Audio Speech and Signal Processing, pages 2017-2020, 2002.
- (2002) International Conference on Audio Speech and Signal Processing , pp. 2017-2020
- Patterson, E.K.¹ Gurbuz, S.² Tufekci, Z.³ Gowdy, J.N.⁴

26
- 0000016172
- A stochastic approximation method
- H. Robbins and S. Monro. A stochastic approximation method. Ann. Math. Stat., 22: 400-407, 1951.
- (1951) Ann. Math. Stat. , vol.22 , pp. 400-407
- Robbins, H.¹ Monro, S.²

27
- 77956556686
- Replicated softmax: An undirected topic model
- R. Salakhutdinov and G. E. Hinton. Replicated softmax: an undirected topic model. In Advances in Neural Information Processing Systems 22, pages 1607-1614, 2009a.
- (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1607-1614
- Salakhutdinov, R.¹ Hinton, G.E.²

28
- 73249147662
- Deep Boltzmann machines
- R. R. Salakhutdinov and G. E. Hinton. Deep Boltzmann machines. In Proceedings of the International Conference on Artificial Intelligence and Statistics, volume 12, 2009b.
- (2009) Proceedings of the International Conference on Artificial Intelligence and Statistics , vol.12
- Salakhutdinov, R.R.¹ Hinton, G.E.²

29
- 0000329993
- Information processing in dynamical systems: Foundations of harmony theory
- MIT Press, Cambridge, MA, USA
- P. Smolensky. Information processing in dynamical systems: Foundations of harmony theory. In Parallel Distributed Processing: Explorations in the Microstructure of Cognition, Vol. 1, pages 194-281. MIT Press, Cambridge, MA, USA, 1986.
- (1986) Parallel Distributed Processing: Explorations in the Microstructure of Cognition , vol.1 , pp. 194-281
- Smolensky, P.¹

30
- 84904163933
- Dropout: A simple way to prevent neural networks from overfitting
- N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15:1929-1958, 2014.
- (2014) Journal of Machine Learning Research , vol.15 , pp. 1929-1958
- Srivastava, N.¹ Hinton, G.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

31
- 84867652321
- Convolutional learning of spatio-temporal features
- G. W. Taylor, R. Fergus, Y. LeCun, and C. Bregler. Convolutional learning of spatio-temporal features. In Eurpean Conference on Computer Vision, 2010.
- Eurpean Conference on Computer Vision, 2010
- Taylor, G.W.¹ Fergus, R.² LeCun, Y.³ Bregler, C.⁴

32
- 56449086223
- Training Restricted Boltzmann Machines using Approximations to the Likelihood Gradient
- T. Tieleman. Training Restricted Boltzmann Machines using Approximations to the Likelihood Gradient. In Proceedings of the 25th International Conference on Machine Learning, pages 1064-1071, 2008.
- (2008) Proceedings of the 25th International Conference on Machine Learning , pp. 1064-1071
- Tieleman, T.¹

33
- 70349362313
- A. Vedaldi and B. Fulkerson. VLFeat: An open and portable library of computer vision algorithms, 2008.
- (2008) VLFeat: An Open and Portable Library of Computer Vision Algorithms
- Vedaldi, A.¹ Fulkerson, B.²

34
- 77952381653
- Image annotation with TagProp on the MIRFLICKR set
- J. Verbeek, M. Guillaumin, T. Mensink, and C. Schmid. Image annotation with TagProp on the MIRFLICKR set. In 11th ACM International Conference on Multimedia Information Retrieval, pages 537-546, 2010.
- (2010) 11th ACM International Conference on Multimedia Information Retrieval , pp. 537-546
- Verbeek, J.¹ Guillaumin, M.² Mensink, T.³ Schmid, C.⁴

35
- 56449089103
- Extracting and composing robust features with denoising autoencoders
- P. Vincent, H. Larochelle, Y. Bengio, and P.-A. Manzagol. Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th International Conference on Machine Learning, pages 1096-1103, 2008.
- (2008) Proceedings of the 25th International Conference on Machine Learning , pp. 1096-1103
- Vincent, P.¹ Larochelle, H.² Bengio, Y.³ Manzagol, P.-A.⁴

36
- 84899000641
- Exponential family harmoniums with an application to information retrieval
- M. Welling, M. Rosen-Zvi, and G. E. Hinton. Exponential family harmoniums with an application to information retrieval. In Advances in Neural Information Processing Systems 18, pages 1481-1488, 2005.
- (2005) Advances in Neural Information Processing Systems , vol.18 , pp. 1481-1488
- Welling, M.¹ Rosen-Zvi, M.² Hinton, G.E.³

37
- 67449114623
- Mining associated text and images with dualwing harmoniums
- AUAI Press
- E. P. Xing, R. Yan, and A. G. Hauptmann. Mining associated text and images with dualwing harmoniums. In Uncertainty in Artificial Intelligence (UAI), pages 633-641. AUAI Press, 2005.
- (2005) Uncertainty in Artificial Intelligence (UAI) , pp. 633-641
- Xing, E.P.¹ Yan, R.² Hauptmann, A.G.³

38
- 0000355193
- Parameter inference for imperfectly observed Gibbsian fields
- L. Younes. Parameter inference for imperfectly observed Gibbsian fields. Probability Theory Rel. Fields, 82:625-645, 1989.
- (1989) Probability Theory Rel. Fields , vol.82 , pp. 625-645
- Younes, L.¹

39
- 33644756784
- On the convergence of Markovian stochastic algorithms with rapidly decreasing ergodicity rates
- L. Younes. On the convergence of Markovian stochastic algorithms with rapidly decreasing ergodicity rates. In Stochastics and Stochastics Models, pages 177-228, 1998.
- (1998) Stochastics and Stochastics Models , pp. 177-228
- Younes, L.¹

40
- 48849114847
- The convergence of contrastive divergences
- A. L. Yuille. The convergence of contrastive divergences. In Advances in Neural Information Processing Systems 17, 2004.
- (2004) Advances in Neural Information Processing Systems , vol.17
- Yuille, A.L.¹

41
- 70350278777
- Lipreading with local spatiotemporal descriptors
- G. Zhao, M. Barnard, and M. Pietikainen. Lipreading with local spatiotemporal descriptors. IEEE Transactions on Multimedia, 11(7):1254-1265, 2009.
- (2009) IEEE Transactions on Multimedia , vol.11 , Issue.7 , pp. 1254-1265
- Zhao, G.¹ Barnard, M.² Pietikainen, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.