-
1
-
-
80052870289
-
Probabilistic event logic for interval-based event recognition
-
June
-
W. Brendel, A. Fern, and S. Todorovic. Probabilistic event logic for interval-based event recognition. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 3329-3336, June 2011
-
(2011)
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
, pp. 3329-3336
-
-
Brendel, W.1
Fern, A.2
Todorovic, S.3
-
2
-
-
84867846313
-
Detecting actions, poses, and objects with relational phraselets
-
Springer
-
C. Desai and D. Ramanan. Detecting actions, poses, and objects with relational phraselets. In Computer Vision-ECCV 2012, pages 158-172. Springer, 2012
-
(2012)
Computer Vision-ECCV 2012
, pp. 158-172
-
-
Desai, C.1
Ramanan, D.2
-
3
-
-
84903622275
-
Fast feature pyramids for object detection
-
Aug
-
P. Dollar, R. Appel, S. Belongie, and P. Perona. Fast feature pyramids for object detection. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 36(8):1532-1545, Aug 2014
-
(2014)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.36
, Issue.8
, pp. 1532-1545
-
-
Dollar, P.1
Appel, R.2
Belongie, S.3
Perona, P.4
-
4
-
-
33846622081
-
Behavior recognition via sparse spatio-temporal features
-
IEEE
-
P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005. 2nd Joint IEEE International Workshop on, pages 65-72. IEEE, 2005
-
(2005)
Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005. 2nd Joint IEEE International Workshop on
, pp. 65-72
-
-
Dollár, P.1
Rabaud, V.2
Cottrell, G.3
Belongie, S.4
-
5
-
-
84865579385
-
Visual event recognition in videos by learning from web data
-
Sept
-
L. Duan, D. Xu, I.-H. Tsang, and J. Luo. Visual event recognition in videos by learning from web data. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 34(9):1667-1680, Sept 2012
-
(2012)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.34
, Issue.9
, pp. 1667-1680
-
-
Duan, L.1
Xu, D.2
Tsang, I.-H.3
Luo, J.4
-
6
-
-
84876258641
-
Learning hierarchical features for scene labeling
-
C. Farabet, C. Couprie, L. Najman, and Y. LeCun. Learning hierarchical features for scene labeling. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 35(8):1915-1929, 2013
-
(2013)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.35
, Issue.8
, pp. 1915-1929
-
-
Farabet, C.1
Couprie, C.2
Najman, L.3
LeCun, Y.4
-
7
-
-
84911400494
-
Rich feature hierarchies for accurate object detection and semantic segmentation
-
June
-
R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 580-587, June 2014
-
(2014)
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
, pp. 580-587
-
-
Girshick, R.1
Donahue, J.2
Darrell, T.3
Malik, J.4
-
8
-
-
84906344142
-
Learning rich features from rgb-d images for object detection and segmentation
-
Springer International Publishing
-
S. Gupta, R. Girshick, P. Arbelez, and J. Malik. Learning rich features from rgb-d images for object detection and segmentation. In Computer Vision ECCV 2014, volume 8695 of Lecture Notes in Computer Science, pages 345-360. Springer International Publishing, 2014
-
(2014)
Computer Vision ECCV 2014, Volume 8695 of Lecture Notes in Computer Science
, pp. 345-360
-
-
Gupta, S.1
Girshick, R.2
Arbelez, P.3
Malik, J.4
-
9
-
-
33746649771
-
Semantic analysis of soccer video using dynamic Bayesian network
-
C.-L. Huang, H.-C. Shih, and C.-Y. Chao. Semantic analysis of soccer video using dynamic Bayesian network. Multimedia, IEEE Transactions on, 8(4):749-760, 2006
-
(2006)
Multimedia, IEEE Transactions on
, vol.8
, Issue.4
, pp. 749-760
-
-
Huang, C.-L.1
Shih, H.-C.2
Chao, C.-Y.3
-
10
-
-
77957969222
-
Recognizing actions from still images
-
IEEE
-
N. Ikizler, R. G. Cinbis, S. Pehlivan, and P. Duygulu. Recognizing actions from still images. In Pattern Recognition, 2008. ICPR 2008. 19th International Conference on, pages 1-4. IEEE, 2008
-
(2008)
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
, pp. 1-4
-
-
Ikizler, N.1
Cinbis, R.G.2
Pehlivan, S.3
Duygulu, P.4
-
11
-
-
84913555165
-
-
arXiv preprint arXiv:1408. 5093
-
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408. 5093, 2014
-
(2014)
Caffe: Convolutional Architecture for Fast Feature Embedding
-
-
Jia, Y.1
Shelhamer, E.2
Donahue, J.3
Karayev, S.4
Long, J.5
Girshick, R.6
Guadarrama, S.7
Darrell, T.8
-
12
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
Curran Associates, Inc.
-
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems 25, pages 1097-1105. Curran Associates, Inc., 2012
-
(2012)
Advances in Neural Information Processing Systems
, vol.25
, pp. 1097-1105
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
13
-
-
33845572523
-
Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
-
IEEE
-
S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, volume 2, pages 2169-2178. IEEE, 2006
-
(2006)
Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on
, vol.2
, pp. 2169-2178
-
-
Lazebnik, S.1
Schmid, C.2
Ponce, J.3
-
16
-
-
70450219021
-
Towards total scene understanding: Classification, annotation and segmentation in an automatic framework
-
June
-
L.-J. Li, R. Socher, and L. Fei-Fei. Towards total scene understanding: Classification, annotation and segmentation in an automatic framework. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pages 2036-2043, June 2009
-
(2009)
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
, pp. 2036-2043
-
-
Li, L.-J.1
Socher, R.2
Fei-Fei, L.3
-
17
-
-
85162513516
-
Object bank: A high-level image representation for scene classification & semantic feature sparsification
-
L.-J. Li, H. Su, L. Fei-Fei, and E. P. Xing. Object bank: A high-level image representation for scene classification & semantic feature sparsification. In Advances in neural information processing systems, pages 1378-1386, 2010
-
(2010)
Advances in Neural Information Processing Systems
, pp. 1378-1386
-
-
Li, L.-J.1
Su, H.2
Fei-Fei, L.3
Xing, E.P.4
-
18
-
-
70350581485
-
Exploiting multi-modal interactions: A unified framework
-
M. Li, X.-B. Xue, and Z.-H. Zhou. Exploiting multi-modal interactions: A unified framework. In IJCAI, pages 1120-1125, 2009
-
(2009)
IJCAI
, pp. 1120-1125
-
-
Li, M.1
Xue, X.-B.2
Zhou, Z.-H.3
-
19
-
-
84906486177
-
Exploiting privileged information from web data for image categorization
-
Springer International Publishing
-
W. Li, L. Niu, and D. Xu. Exploiting privileged information from web data for image categorization. In Computer Vision ECCV 2014, volume 8693 of Lecture Notes in Computer Science, pages 437-452. Springer International Publishing, 2014
-
(2014)
Computer Vision ECCV 2014, Volume 8693 of Lecture Notes in Computer Science
, pp. 437-452
-
-
Li, W.1
Niu, L.2
Xu, D.3
-
20
-
-
84898770979
-
Pedestrian parsing via deep decompositional network
-
IEEE
-
P. Luo, X. Wang, and X. Tang. Pedestrian parsing via deep decompositional network. In Computer Vision (ICCV), 2013 IEEE International Conference on, pages 2648-2655. IEEE, 2013
-
(2013)
Computer Vision (ICCV), 2013 IEEE International Conference on
, pp. 2648-2655
-
-
Luo, P.1
Wang, X.2
Tang, X.3
-
21
-
-
80052880806
-
Action recognition from a distributed representation of pose and appearance
-
IEEE
-
S. Maji, L. Bourdev, and J. Malik. Action recognition from a distributed representation of pose and appearance. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 3177-3184. IEEE, 2011
-
(2011)
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
, pp. 3177-3184
-
-
Maji, S.1
Bourdev, L.2
Malik, J.3
-
22
-
-
33747626730
-
Large-scale concept ontology for multimedia
-
M. Naphade, J. R. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and J. Curtis. Large-scale concept ontology for multimedia. MultiMedia, IEEE, 13(3):86-91, 2006
-
(2006)
MultiMedia, IEEE
, vol.13
, Issue.3
, pp. 86-91
-
-
Naphade, M.1
Smith, J.R.2
Tesic, J.3
Chang, S.-F.4
Hsu, W.5
Kennedy, L.6
Hauptmann, A.7
Curtis, J.8
-
23
-
-
80053437179
-
Multimodal deep learning
-
J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Y. Ng. Multimodal deep learning. In Proceedings of the 28th International Conference on Machine Learning (ICML-11), pages 689-696, 2011
-
(2011)
Proceedings of the 28th International Conference on Machine Learning (ICML-11)
, pp. 689-696
-
-
Ngiam, J.1
Khosla, A.2
Kim, M.3
Nam, J.4
Lee, H.5
Ng, A.Y.6
-
24
-
-
0035328421
-
Modeling the shape of the scene: A holistic representation of the spatial envelope
-
A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. International journal of computer vision, 42(3):145-175, 2001
-
(2001)
International Journal of Computer Vision
, vol.42
, Issue.3
, pp. 145-175
-
-
Oliva, A.1
Torralba, A.2
-
25
-
-
85077311325
-
Trecvid 2014-an overview of the goals, tasks, data, evaluation mechanisms and metrics
-
USA
-
P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, W. Kraaij, A. F. Smeaton, and G. Quenot. Trecvid 2014-an overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID 2014. NIST, USA, 2014
-
(2014)
Proceedings of TRECVID 2014. NIST
-
-
Over, P.1
Awad, G.2
Michel, M.3
Fiscus, J.4
Sanders, G.5
Kraaij, W.6
Smeaton, A.F.7
Quenot, G.8
-
26
-
-
84856650974
-
Scene recognition and weakly supervised object localization with deformable part-based models
-
Nov
-
M. Pandey and S. Lazebnik. Scene recognition and weakly supervised object localization with deformable part-based models. In Computer Vision (ICCV), 2011 IEEE International Conference on, pages 1307-1314, Nov 2011
-
(2011)
Computer Vision (ICCV), 2011 IEEE International Conference on
, pp. 1307-1314
-
-
Pandey, M.1
Lazebnik, S.2
-
27
-
-
78149349613
-
Tracklet descriptors for action modeling and video analysis
-
Springer
-
M. Raptis and S. Soatto. Tracklet descriptors for action modeling and video analysis. In Computer Vision-ECCV 2010, pages 577-590. Springer, 2010
-
(2010)
Computer Vision-ECCV 2010
, pp. 577-590
-
-
Raptis, M.1
Soatto, S.2
-
28
-
-
84909978410
-
-
CoRR, abs/1409. 0575
-
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. S. Bernstein, A. C. Berg, and L. Fei-Fei. Imagenet large scale visual recognition challenge. CoRR, abs/1409. 0575, 2014
-
(2014)
Imagenet Large Scale Visual Recognition Challenge
-
-
Russakovsky, O.1
Deng, J.2
Su, H.3
Krause, J.4
Satheesh, S.5
Ma, S.6
Huang, Z.7
Karpathy, A.8
Khosla, A.9
Bernstein, M.S.10
Berg, A.C.11
Fei-Fei, L.12
-
32
-
-
84866658784
-
Learning latent temporal structure for complex event detection
-
June
-
K. Tang, L. Fei-Fei, and D. Koller. Learning latent temporal structure for complex event detection. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 1250-1257, June 2012
-
(2012)
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
, pp. 1250-1257
-
-
Tang, K.1
Fei-Fei, L.2
Koller, D.3
-
33
-
-
84881160857
-
Selective search for object recognition
-
J. R. Uijlings, K. E. van de Sande, T. Gevers, and A. W. Smeulders. Selective search for object recognition. International journal of computer vision, 104(2):154-171, 2013
-
(2013)
International Journal of Computer Vision
, vol.104
, Issue.2
, pp. 154-171
-
-
Uijlings, J.R.1
Sande De Van, K.E.2
Gevers, T.3
Smeulders, A.W.4
-
34
-
-
84898890371
-
Evaluation of local spatio-temporal features for action recognition
-
H. Wang, M. M. Ullah, A. Klaser, I. Laptev, C. Schmid, et al. Evaluation of local spatio-temporal features for action recognition. In BMVC 2009-British Machine Vision Conference, 2009
-
(2009)
BMVC 2009-British Machine Vision Conference
-
-
Wang, H.1
Ullah, M.M.2
Klaser, A.3
Laptev, I.4
Schmid, C.5
-
35
-
-
84911434661
-
Zero-shot event detection using multi-modal fusion of weakly supervised concepts
-
IEEE
-
S. Wu, S. Bondugula, F. Luisier, X. Zhuang, and P. Natarajan. Zero-shot event detection using multi-modal fusion of weakly supervised concepts. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 2665-2672. IEEE, 2014
-
(2014)
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
, pp. 2665-2672
-
-
Wu, S.1
Bondugula, S.2
Luisier, F.3
Zhuang, X.4
Natarajan, P.5
-
36
-
-
77955988947
-
Sun database: Large-scale scene recognition from abbey to zoo
-
June
-
J. Xiao, J. Hays, K. Ehinger, A. Oliva, and A. Torralba. Sun database: Large-scale scene recognition from abbey to zoo. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 3485-3492, June 2010
-
(2010)
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
, pp. 3485-3492
-
-
Xiao, J.1
Hays, J.2
Ehinger, K.3
Oliva, A.4
Torralba, A.5
-
37
-
-
85060905667
-
Recognizing human action in time-sequential images using hidden markov model
-
IEEE
-
J. Yamato, J. Ohya, and K. Ishii. Recognizing human action in time-sequential images using hidden markov model. In Computer Vision and Pattern Recognition, 1992. Proceedings CVPR'92., 1992 IEEE Computer Society Conference on, pages 379-385. IEEE, 1992
-
(1992)
Computer Vision and Pattern Recognition, 1992. Proceedings CVPR'92., 1992 IEEE Computer Society Conference on
, pp. 379-385
-
-
Yamato, J.1
Ohya, J.2
Ishii, K.3
-
38
-
-
77955996308
-
Recognizing human actions from still images with latent poses
-
IEEE
-
W. Yang, Y. Wang, and G. Mori. Recognizing human actions from still images with latent poses. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 2030-2037. IEEE, 2010
-
(2010)
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
, pp. 2030-2037
-
-
Yang, W.1
Wang, Y.2
Mori, G.3
-
39
-
-
84867886443
-
Complex events detection using datadriven concepts
-
Springer Berlin Heidelberg
-
Y. Yang and M. Shah. Complex events detection using datadriven concepts. In Computer Vision ECCV 2012, volume 7574, pages 722-735. Springer Berlin Heidelberg, 2012
-
(2012)
Computer Vision ECCV 2012
, vol.7574
, pp. 722-735
-
-
Yang, Y.1
Shah, M.2
-
40
-
-
84865593256
-
Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses
-
B. Yao and L. Fei-Fei. Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 34(9):1691-1703, 2012
-
(2012)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.34
, Issue.9
, pp. 1691-1703
-
-
Yao, B.1
Fei-Fei, L.2
-
41
-
-
84906489617
-
Edge boxes: Locating object proposals from edges
-
Springer International Publishing
-
C. Zitnick and P. Dollr. Edge boxes: Locating object proposals from edges. In Computer Vision ECCV 2014, volume 8693, pages 391-405. Springer International Publishing, 2014.
-
(2014)
Computer Vision ECCV 2014
, vol.8693
, pp. 391-405
-
-
Zitnick, C.1
Dollr, P.2
|