-
1
-
-
85008039046
-
Bilvideo-7: An MPEG-7- compatible video indexing and retrieval system
-
M. Bastan, H. Cam, U. Gudukbay, and O. Ulusoy. Bilvideo-7: An MPEG-7- compatible video indexing and retrieval system. IEEE Multimedia, 17:62-73, 2010. ISSN 1070-986X.
-
(2010)
IEEE Multimedia
, vol.17
, pp. 62-73
-
-
Bastan, M.1
Cam, H.2
Gudukbay, U.3
Ulusoy, O.4
-
3
-
-
85133487154
-
The challenge of multispeaker lip-reading
-
September
-
S. Cox, R. Harvey, Y. Lan, J. Newman, and B. Theobald. The challenge of multispeaker lip-reading. In International Conference on Auditory-Visual Speech Processing, pages 179-184, September 2008.
-
(2008)
International Conference on Auditory-Visual Speech Processing
, pp. 179-184
-
-
Cox, S.1
Harvey, R.2
Lan, Y.3
Newman, J.4
Theobald, B.5
-
6
-
-
56449085852
-
-
Technical report, University of California at Santa Cruz, Santa Cruz, CA, USA
-
Y. Freund and D. Haussler. Unsupervised learning of distributions on binary vectors using two layer networks. Technical report, University of California at Santa Cruz, Santa Cruz, CA, USA, 1994.
-
(1994)
Unsupervised Learning of Distributions on Binary Vectors Using Two Layer Networks
-
-
Freund, Y.1
Haussler, D.2
-
8
-
-
70450273199
-
Information theoretic feature extraction for audio-visual speech recognition
-
M. Gurban and J.-P. Thiran. Information theoretic feature extraction for audio-visual speech recognition. IEEE Transactions on Signal Processing, 57(12):4765-4776, 2009.
-
(2009)
IEEE Transactions on Signal Processing
, vol.57
, Issue.12
, pp. 4765-4776
-
-
Gurban, M.1
Thiran, J.-P.2
-
9
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
G. Hinton and R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504-507, 2006.
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.1
Salakhutdinov, R.2
-
10
-
-
0013344078
-
Training products of experts by minimizing contrastive divergence
-
G. E. Hinton. Training products of experts by minimizing contrastive divergence. Neural Computation, 14(8):1711-1800, 2002.
-
(2002)
Neural Computation
, vol.14
, Issue.8
, pp. 1711-1800
-
-
Hinton, G.E.1
-
11
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G. E. Hinton, S. Osindero, and Y. W. Teh. A fast learning algorithm for deep belief nets. Neural Computation, 18(7):1527-1554, 2006.
-
(2006)
Neural Computation
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.W.3
-
12
-
-
84890466217
-
Improving neural networks by preventing co-adaptation of feature detectors
-
abs/1207.0580
-
G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Improving neural networks by preventing co-adaptation of feature detectors. CoRR, abs/1207.0580, 2012.
-
(2012)
CoRR
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
15
-
-
71149119164
-
Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
-
H. Lee, R. Grosse, R. Ranganath, and A. Y. Ng. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In Proceedings of the 26th International Conference on Machine Learning, pages 609-616, 2009.
-
(2009)
Proceedings of the 26th International Conference on Machine Learning
, pp. 609-616
-
-
Lee, H.1
Grosse, R.2
Ranganath, R.3
Ng, A.Y.4
-
16
-
-
84890504350
-
Patch-based representation of visual speech
-
VisHCI '06, Australian Computer Society, Inc.
-
P. Lucey and S. Sridharan. Patch-based representation of visual speech. In Proceedings of the HCSNet workshop on Use of vision in human-computer interaction - Volume 56, VisHCI '06, pages 79-85. Australian Computer Society, Inc., 2006.
-
(2006)
Proceedings of the HCSNet Workshop on Use of Vision in Human-computer Interaction
, vol.56
, pp. 79-85
-
-
Lucey, P.1
Sridharan, S.2
-
17
-
-
0035365392
-
Color and texture descriptors
-
B. Manjunath, J.-R. Ohm, V. Vasudevan, and A. Yamada. Color and texture descriptors. Circuits and Systems for Video Technology, IEEE Transactions on, 11(6):703-715, 2001.
-
(2001)
Circuits and Systems for Video Technology, IEEE Transactions on
, vol.11
, Issue.6
, pp. 703-715
-
-
Manjunath, B.1
Ohm, J.-R.2
Vasudevan, V.3
Yamada, A.4
-
18
-
-
0036472941
-
Extraction of visual features for lipreading
-
Feb.
-
I. Matthews, T. F. Cootes, J. A. Bangham, S. Cox, and R. Harvey. Extraction of visual features for lipreading. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(2):198-213, Feb. 2002.
-
(2002)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.24
, Issue.2
, pp. 198-213
-
-
Matthews, I.1
Cootes, T.F.2
Bangham, J.A.3
Cox, S.4
Harvey, R.5
-
19
-
-
80051624332
-
Acoustic modeling using deep belief networks
-
A. Mohamed, G. Dahl, and G. Hinton. Acoustic modeling using deep belief networks. IEEE Transactions on Audio, Speech, and Language Processing, 2011.
-
(2011)
IEEE Transactions on Audio, Speech, and Language Processing
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
20
-
-
80053437179
-
Multimodal deep learning
-
J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Y. Ng. Multimodal deep learning. In Proceedings of the 28th International Conference on Machine Learning, 2011.
-
Proceedings of the 28th International Conference on Machine Learning, 2011
-
-
Ngiam, J.1
Khosla, A.2
Kim, M.3
Nam, J.4
Lee, H.5
Ng, A.Y.6
-
21
-
-
0035328421
-
Modeling the shape of the scene: A holistic representation of the spatial envelope
-
A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42:145-175, 2001.
-
(2001)
International Journal of Computer Vision
, vol.42
, pp. 145-175
-
-
Oliva, A.1
Torralba, A.2
-
22
-
-
0029938380
-
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
-
B. A. Olshausen and D. J. Field. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature, 381(6583):607-609, 1996.
-
(1996)
Nature
, vol.381
, Issue.6583
, pp. 607-609
-
-
Olshausen, B.A.1
Field, D.J.2
-
23
-
-
48149100910
-
Multimodal fusion and learning with uncertain features applied to audiovisual speech recognition
-
G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos. Multimodal fusion and learning with uncertain features applied to audiovisual speech recognition. In IEEE 9th Workshop on Multimedia Signal Processing, pages 264-267, 2007.
-
(2007)
IEEE 9th Workshop on Multimedia Signal Processing
, pp. 264-267
-
-
Papandreou, G.1
Katsamanis, A.2
Pitsikalis, V.3
Maragos, P.4
-
24
-
-
69849103259
-
Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition
-
G. Papandreou, A. Katsamanis, V. Pitsikalis, and P. Maragos. Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(3):423-435, 2009.
-
(2009)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.17
, Issue.3
, pp. 423-435
-
-
Papandreou, G.1
Katsamanis, A.2
Pitsikalis, V.3
Maragos, P.4
-
25
-
-
0036299249
-
Cuave: A new audio-visual database for multimodal human-computer interface research
-
E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy. Cuave: A new audio-visual database for multimodal human-computer interface research. In International Conference on Audio Speech and Signal Processing, pages 2017-2020, 2002.
-
(2002)
International Conference on Audio Speech and Signal Processing
, pp. 2017-2020
-
-
Patterson, E.K.1
Gurbuz, S.2
Tufekci, Z.3
Gowdy, J.N.4
-
26
-
-
0000016172
-
A stochastic approximation method
-
H. Robbins and S. Monro. A stochastic approximation method. Ann. Math. Stat., 22: 400-407, 1951.
-
(1951)
Ann. Math. Stat.
, vol.22
, pp. 400-407
-
-
Robbins, H.1
Monro, S.2
-
29
-
-
0000329993
-
Information processing in dynamical systems: Foundations of harmony theory
-
MIT Press, Cambridge, MA, USA
-
P. Smolensky. Information processing in dynamical systems: Foundations of harmony theory. In Parallel Distributed Processing: Explorations in the Microstructure of Cognition, Vol. 1, pages 194-281. MIT Press, Cambridge, MA, USA, 1986.
-
(1986)
Parallel Distributed Processing: Explorations in the Microstructure of Cognition
, vol.1
, pp. 194-281
-
-
Smolensky, P.1
-
30
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15:1929-1958, 2014.
-
(2014)
Journal of Machine Learning Research
, vol.15
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
34
-
-
77952381653
-
Image annotation with TagProp on the MIRFLICKR set
-
J. Verbeek, M. Guillaumin, T. Mensink, and C. Schmid. Image annotation with TagProp on the MIRFLICKR set. In 11th ACM International Conference on Multimedia Information Retrieval, pages 537-546, 2010.
-
(2010)
11th ACM International Conference on Multimedia Information Retrieval
, pp. 537-546
-
-
Verbeek, J.1
Guillaumin, M.2
Mensink, T.3
Schmid, C.4
-
35
-
-
56449089103
-
Extracting and composing robust features with denoising autoencoders
-
P. Vincent, H. Larochelle, Y. Bengio, and P.-A. Manzagol. Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th International Conference on Machine Learning, pages 1096-1103, 2008.
-
(2008)
Proceedings of the 25th International Conference on Machine Learning
, pp. 1096-1103
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.-A.4
-
38
-
-
0000355193
-
Parameter inference for imperfectly observed Gibbsian fields
-
L. Younes. Parameter inference for imperfectly observed Gibbsian fields. Probability Theory Rel. Fields, 82:625-645, 1989.
-
(1989)
Probability Theory Rel. Fields
, vol.82
, pp. 625-645
-
-
Younes, L.1
-
39
-
-
33644756784
-
On the convergence of Markovian stochastic algorithms with rapidly decreasing ergodicity rates
-
L. Younes. On the convergence of Markovian stochastic algorithms with rapidly decreasing ergodicity rates. In Stochastics and Stochastics Models, pages 177-228, 1998.
-
(1998)
Stochastics and Stochastics Models
, pp. 177-228
-
-
Younes, L.1
-
41
-
-
70350278777
-
Lipreading with local spatiotemporal descriptors
-
G. Zhao, M. Barnard, and M. Pietikainen. Lipreading with local spatiotemporal descriptors. IEEE Transactions on Multimedia, 11(7):1254-1265, 2009.
-
(2009)
IEEE Transactions on Multimedia
, vol.11
, Issue.7
, pp. 1254-1265
-
-
Zhao, G.1
Barnard, M.2
Pietikainen, M.3
|