-
2
-
-
77955989314
-
Cross-dataset action detection
-
L. Cao, Z. Liu, and T. S. Huang. Cross-dataset action detection. In CVPR, 2010.
-
(2010)
CVPR
-
-
Cao, L.1
Liu, Z.2
Huang, T.S.3
-
3
-
-
84899755756
-
Event-driven semantic concept discovery by exploiting weakly tagged internet images
-
J. Chen, Y. Cui, G. Ye, D. Liu, and S.-F. Chang. Event-driven semantic concept discovery by exploiting weakly tagged internet images. In ICMR, 2014.
-
(2014)
ICMR
-
-
Chen, J.1
Cui, Y.2
Ye, G.3
Liu, D.4
Chang, S.-F.5
-
4
-
-
84959216393
-
Textual similarity with a bag-ofembedded-words model
-
S. Clinchant and F. Perronnin. Textual similarity with a bag-ofembedded-words model. In ICTIR, 2013.
-
(2013)
ICTIR
-
-
Clinchant, S.1
Perronnin, F.2
-
5
-
-
85198028989
-
Imagenet: A large-scale hierarchical image database
-
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR, 2009.
-
(2009)
CVPR
-
-
Deng, J.1
Dong, W.2
Socher, R.3
Li, L.-J.4
Li, K.5
Fei-Fei, L.6
-
6
-
-
84898803425
-
Write a classifier: Zeroshot learning using purely textual descriptions
-
M. Elhoseiny, B. Saleh, and A. Elgammal. Write a classifier: Zeroshot learning using purely textual descriptions. In ICCV, 2013.
-
(2013)
ICCV
-
-
Elhoseiny, M.1
Saleh, B.2
Elgammal, A.3
-
7
-
-
84900116262
-
Evaluation of color spatio-temporal interest points for human action recognition
-
I. Everts, J. C. van Gemert, and T. Gevers. Evaluation of color spatio-temporal interest points for human action recognition. TIP, 23(4):1569-1580, 2014.
-
(2014)
TIP
, vol.23
, Issue.4
, pp. 1569-1580
-
-
Everts, I.1
Van Gemert, J.C.2
Gevers, T.3
-
9
-
-
84898958665
-
Devise: A deep visual-semantic embedding model
-
A. FRome, G. Corrado, J. Shlens, S. Bengio, J. Dean, M. Ranzato, and T. Mikolov. Devise: A deep visual-semantic embedding model. In NIPS, 2013.
-
(2013)
NIPS
-
-
Rome, F.A.1
Corrado, G.2
Shlens, J.3
Bengio, S.4
Dean, J.5
Ranzato, M.6
Mikolov, T.7
-
10
-
-
84898773262
-
Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
-
S. Guadarrama, N. Krishnamoorthy, G. Malkarnenkar, S. Venugopalan, R. Mooney, T. Darrell, and K. Saenko. Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In ICCV, 2013.
-
(2013)
ICCV
-
-
Guadarrama, S.1
Krishnamoorthy, N.2
Malkarnenkar, G.3
Venugopalan, S.4
Mooney, R.5
Darrell, T.6
Saenko, K.7
-
11
-
-
84899708619
-
Composite concept discovery for zero-shot video event detection
-
A. Habibian, T. Mensink, and C. Snoek. Composite concept discovery for zero-shot video event detection. In ICMR, 2014.
-
(2014)
ICMR
-
-
Habibian, A.1
Mensink, T.2
Snoek, C.3
-
12
-
-
84887398298
-
Better exploiting motion for better action recognition
-
M. Jain, H. Jégou, and P. Bouthemy. Better exploiting motion for better action recognition. In CVPR, 2013.
-
(2013)
CVPR
-
-
Jain, M.1
Jégou, H.2
Bouthemy, P.3
-
13
-
-
84911453664
-
Action localization with tubelets from motion
-
M. Jain, J. van Gemert, H. Jégou, P. Bouthemy, and C. Snoek. Action localization with tubelets from motion. In CVPR, 2014.
-
(2014)
CVPR
-
-
Jain, M.1
Van Gemert, J.2
Jégou, H.3
Bouthemy, P.4
Snoek, C.5
-
14
-
-
84959235126
-
What do 15, 000 object categories tell us about classifying and localizing actions? in
-
M. Jain, J. van Gemert, and C. Snoek. What do 15, 000 object categories tell us about classifying and localizing actions? In CVPR, 2015.
-
(2015)
CVPR
-
-
Jain, M.1
Van Gemert, J.2
Snoek, C.3
-
15
-
-
84899746163
-
Zero-example event search using multimodal pseudo relevance feedback
-
L. Jiang, T. Mitamura, S. Yu, and A. Hauptmann. Zero-example event search using multimodal pseudo relevance feedback. In ICMR, 2014.
-
(2014)
ICMR
-
-
Jiang, L.1
Mitamura, T.2
Yu, S.3
Hauptmann, A.4
-
16
-
-
84905052261
-
-
Y.-G. Jiang, J. Liu, A. Roshan Zamir, G. Toderici, I. Laptev, M. Shah, and R. Sukthankar. THUMOS challenge: Action recognition with a large number of classes. http://crcv. ucf. edu/THUMOS14/, 2014.
-
(2014)
THUMOS Challenge: Action Recognition with A Large Number of Classes.
-
-
Jiang, Y.-G.1
Liu, J.2
Roshan Zamir, A.3
Toderici, G.4
Laptev, I.5
Shah, M.6
Sukthankar, R.7
-
17
-
-
84911364368
-
Large-scale video classification with convolutional neural networks
-
A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei. Large-scale video classification with convolutional neural networks. In CVPR, 2014.
-
(2014)
CVPR
-
-
Karpathy, A.1
Toderici, G.2
Shetty, S.3
Leung, T.4
Sukthankar, R.5
Fei-Fei, L.6
-
18
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
19
-
-
84856682691
-
HMDB: A large video database for human motion recognition
-
H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, and T. Serre. HMDB: A large video database for human motion recognition. In ICCV, 2011.
-
(2011)
ICCV
-
-
Kuehne, H.1
Jhuang, H.2
Garrote, E.3
Poggio, T.4
Serre, T.5
-
20
-
-
70450172710
-
Learning to detect unseen object classes by between-class attribute transfer
-
C. Lampert, H. Nickisch, and S. Harmeling. Learning to detect unseen object classes by between-class attribute transfer. In CVPR, 2009.
-
(2009)
CVPR
-
-
Lampert, C.1
Nickisch, H.2
Harmeling, S.3
-
21
-
-
84863083227
-
Discriminative figure-centric models for joint action localization and recognition
-
T. Lan, Y. Wang, and G. Mori. Discriminative figure-centric models for joint action localization and recognition. In ICCV, 2011.
-
(2011)
ICCV
-
-
Lan, T.1
Wang, Y.2
Mori, G.3
-
22
-
-
84919829999
-
Distributed representations of sentences and documents
-
Q. Le and T. Mikolov. Distributed representations of sentences and documents. In ICML, 2014.
-
(2014)
ICML
-
-
Le, Q.1
Mikolov, T.2
-
23
-
-
84894424062
-
Object bank: An objectlevel image representation for high-level visual recognition
-
L.-J. Li, H. Su, Y. Lim, and L. Fei-Fei. Object bank: An objectlevel image representation for high-level visual recognition. IJCV, 107(1):20-39, 2014.
-
(2014)
IJCV
, vol.107
, Issue.1
, pp. 20-39
-
-
Li, L.-J.1
Su, H.2
Lim, Y.3
Fei-Fei, L.4
-
24
-
-
80052915325
-
Recognizing human actions by attributes
-
J. Liu, B. Kuipers, and S. Savarese. Recognizing human actions by attributes. In CVPR, 2011.
-
(2011)
CVPR
-
-
Liu, J.1
Kuipers, B.2
Savarese, S.3
-
25
-
-
84911410734
-
Costa: Co-occurrence statistics for zero-shot classification
-
T. Mensink, E. Gavves, and C. Snoek. Costa: Co-occurrence statistics for zero-shot classification. In CVPR, 2014.
-
(2014)
CVPR
-
-
Mensink, T.1
Gavves, E.2
Snoek, C.3
-
26
-
-
85083951332
-
Efficient estimation of word representations in vector space
-
T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. In ICLR, 2013.
-
(2013)
ICLR
-
-
Mikolov, T.1
Chen, K.2
Corrado, G.3
Dean, J.4
-
27
-
-
84898956512
-
Distributed representations of words and phrases and their compositionality
-
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013.
-
(2013)
NIPS
-
-
Mikolov, T.1
Sutskever, I.2
Chen, K.3
Corrado, G.S.4
Dean, J.5
-
28
-
-
84926034201
-
Evaluating neural word representations in tensor-based compositional settings
-
D. Milajevs, D. Kartsaklis, M. Sadrzadeh, and M. Purver. Evaluating neural word representations in tensor-based compositional settings. In EMNLP, 2014.
-
(2014)
EMNLP
-
-
Milajevs, D.1
Kartsaklis, D.2
Sadrzadeh, M.3
Purver, M.4
-
29
-
-
85083952206
-
Zero-shot learning by convex combination of semantic embeddings
-
M. Norouzi, T. Mikolov, S. Bengio, Y. Singer, J. Shlens, A. FRome, G. Corrado, and J. Dean. Zero-shot learning by convex combination of semantic embeddings. In ICLR, 2014.
-
(2014)
ICLR
-
-
Norouzi, M.1
Mikolov, T.2
Bengio, S.3
Singer, Y.4
Shlens, J.5
Frome, A.6
Corrado, G.7
Dean, J.8
-
31
-
-
85085788280
-
Trecvid 2013-an introduction to the goals, tasks, data, evaluation mechanisms, and metrics
-
P. Over, G. Awad, J. Fiscus, and G. Sanders. Trecvid 2013-an introduction to the goals, tasks, data, evaluation mechanisms, and metrics. In TRECVID Workshop, 2013.
-
(2013)
TRECVID Workshop
-
-
Over, P.1
Awad, G.2
Fiscus, J.3
Sanders, G.4
-
33
-
-
84959242896
-
Boosting vlad with supervised dictionary learning and high-order statistics
-
X. Peng, L. Wang, Y. Qiao, and Q. Peng. Boosting vlad with supervised dictionary learning and high-order statistics. In ECCV, 2014.
-
(2014)
ECCV
-
-
Peng, X.1
Wang, L.2
Qiao, Y.3
Peng, Q.4
-
34
-
-
84947130265
-
Action recognition with stacked fisher vectors
-
X. Peng, C. Zou, Y. Qiao, and Q. Peng. Action recognition with stacked fisher vectors. In ECCV, 2014.
-
(2014)
ECCV
-
-
Peng, X.1
Zou, C.2
Qiao, Y.3
Peng, Q.4
-
35
-
-
51949084792
-
Action Mach: A spatiotemporal maximum average correlation height filter for action recognition
-
M. D. Rodriguez, J. Ahmed, and M. Shah. Action MACH: A spatiotemporal maximum average correlation height filter for action recognition. In CVPR, 2008.
-
(2008)
CVPR
-
-
Rodriguez, M.D.1
Ahmed, J.2
Shah, M.3
-
36
-
-
80052892795
-
Evaluating knowledge transfer and zero-shot learning in a large-scale setting
-
M. Rohrbach, M. Stark, and B. Schiele. Evaluating knowledge transfer and zero-shot learning in a large-scale setting. In CVPR, 2011.
-
(2011)
CVPR
-
-
Rohrbach, M.1
Stark, M.2
Schiele, B.3
-
37
-
-
84866718894
-
Action bank: A high-level representation of activity in video
-
S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video. In CVPR, 2012.
-
(2012)
CVPR
-
-
Sadanand, S.1
Corso, J.J.2
-
38
-
-
84883487458
-
Image classification with the fisher vector: Theory and practice
-
J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek. Image classification with the fisher vector: Theory and practice. IJCV, 2013.
-
(2013)
IJCV
-
-
Sánchez, J.1
Perronnin, F.2
Mensink, T.3
Verbeek, J.4
-
39
-
-
84937862424
-
Two-stream convolutional networks for action recognition in videos
-
K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. In NIPS, 2014.
-
(2014)
NIPS
-
-
Simonyan, K.1
Zisserman, A.2
-
40
-
-
85085787180
-
MediaMill at TRECVID 2013: Searching concepts, objects, instances and events in video
-
C. Snoek et al. MediaMill at TRECVID 2013: Searching concepts, objects, instances and events in video. In TRECVID, 2013.
-
(2013)
TRECVID
-
-
Snoek, C.1
-
42
-
-
84893702065
-
UCF101: A dataset of 101 human actions classes from videos in the wild
-
K. Soomro, A. R. Zamir, and M. Shah. UCF101: A dataset of 101 human actions classes from videos in the wild. CoRR, 2012.
-
(2012)
CoRR
-
-
Soomro, K.1
Zamir, A.R.2
Shah, M.3
-
43
-
-
84973858597
-
Semantic aware video transcription using random forest classifiers
-
C. Sun and R. Nevatia. Semantic aware video transcription using random forest classifiers. In ECCV, 2014.
-
(2014)
ECCV
-
-
Sun, C.1
Nevatia, R.2
-
44
-
-
84949572890
-
-
arXiv preprint arXiv:1503. 01817
-
B. Thomee, D. A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth, and L.-J. Li. The new data and new challenges in multimedia research. ArXiv preprint arXiv:1503. 01817, 2015.
-
(2015)
The New Data and New Challenges in Multimedia Research
-
-
Thomee, B.1
Shamma, D.A.2
Friedland, G.3
Elizalde, B.4
Ni, K.5
Poland, D.6
Borth, D.7
Li, L.-J.8
-
45
-
-
84887356306
-
Spatiotemporal deformable part models for action detection
-
Y. Tian, R. Sukthankar, and M. Shah. Spatiotemporal deformable part models for action detection. In CVPR, 2013.
-
(2013)
CVPR
-
-
Tian, Y.1
Sukthankar, R.2
Shah, M.3
-
46
-
-
84973913561
-
APT: Action localization proposals from dense trajectories
-
J. van Gemert, M. Jain, E. Gati, and C. Snoek. APT: Action localization proposals from dense trajectories. In BMVC, 2015.
-
(2015)
BMVC
-
-
Van Gemert, J.1
Jain, M.2
Gati, E.3
Snoek, C.4
-
47
-
-
84898805910
-
Action recognition with improved trajectories
-
H. Wang and C. Schmid. Action recognition with improved trajectories. In ICCV, 2013.
-
(2013)
ICCV
-
-
Wang, H.1
Schmid, C.2
-
48
-
-
84911434661
-
Zeroshot event detection using multi-modal fusion of weakly supervised concepts
-
S. Wu, S. Bondugula, F. Luisier, X. Zhuang, and P. Natarajan. Zeroshot event detection using multi-modal fusion of weakly supervised concepts. In CVPR, 2014.
-
(2014)
CVPR
-
-
Wu, S.1
Bondugula, S.2
Luisier, F.3
Zhuang, X.4
Natarajan, P.5
-
49
-
-
84959226659
-
A discriminative CNN video representation for event detection
-
Z. Xu, Y. Yang, and A. G. Hauptmann. A discriminative CNN video representation for event detection. In CVPR, 2015.
-
(2015)
CVPR
-
-
Xu, Z.1
Yang, Y.2
Hauptmann, A.G.3
|