-
2
-
-
33745891801
-
Actions as space-time shapes
-
IEEE
-
M. Blank, L. Gorelick, E. Shechtman, M. Irani, and R. Basri. Actions as space-time shapes. In Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on, volume 2, pages 1395-1402. IEEE, 2005.
-
(2005)
Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on
, vol.2
, pp. 1395-1402
-
-
Blank, M.1
Gorelick, L.2
Shechtman, E.3
Irani, M.4
Basri, R.5
-
3
-
-
84959216468
-
Activitynet: A large-scale video benchmark for human activity understanding
-
F. Caba Heilbron, V. Escorcia, B. Ghanem, and J. Carlos Niebles. Activitynet: A large-scale video benchmark for human activity understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 961-970, 2015.
-
(2015)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
, pp. 961-970
-
-
Caba Heilbron, F.1
Escorcia, V.2
Ghanem, B.3
Carlos Niebles, J.4
-
4
-
-
85009912425
-
-
arXiv preprint arXiv:1411.4389
-
J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. arXiv preprint arXiv:1411.4389, 2014.
-
(2014)
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
5
-
-
84911443425
-
Scalable object detection using deep neural networks
-
IEEE
-
D. Erhan, C. Szegedy, A. Toshev, and D. Anguelov. Scalable object detection using deep neural networks. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 2155-2162. IEEE, 2014.
-
(2014)
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
, pp. 2155-2162
-
-
Erhan, D.1
Szegedy, C.2
Toshev, A.3
Anguelov, D.4
-
6
-
-
84955316677
-
-
arXiv preprint arXiv:1504.08083
-
R. Girshick. Fast r-cnn. arXiv preprint arXiv:1504.08083, 2015.
-
(2015)
Fast R-cnn
-
-
Girshick, R.1
-
8
-
-
70450202741
-
Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
-
IEEE
-
A. Gupta, P. Srinivasan, J. Shi, and L. S. Davis. Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pages 2012-2019. IEEE, 2009.
-
(2009)
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
, pp. 2012-2019
-
-
Gupta, A.1
Srinivasan, P.2
Shi, J.3
Davis, L.S.4
-
9
-
-
84911453664
-
Action localization with tubelets from motion
-
IEEE
-
M. Jain, J. Van Gemert, H. Jégou, P. Bouthemy, and C. G. Snoek. Action localization with tubelets from motion. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 740-747. IEEE, 2014.
-
(2014)
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
, pp. 740-747
-
-
Jain, M.1
Van Gemert, J.2
Jégou, H.3
Bouthemy, P.4
Snoek, C.G.5
-
10
-
-
84898819791
-
Towards understanding action recognition
-
IEEE
-
H. Jhuang, J. Gall, S. Zuffi, C. Schmid, and M. J. Black. Towards understanding action recognition. In Computer Vision (ICCV), 2013 IEEE International Conference on, pages 3192-3199. IEEE, 2013.
-
(2013)
Computer Vision (ICCV), 2013 IEEE International Conference on
, pp. 3192-3199
-
-
Jhuang, H.1
Gall, J.2
Zuffi, S.3
Schmid, C.4
Black, M.J.5
-
11
-
-
84905052261
-
-
Y.-G. Jiang, J. Liu, A. Roshan Zamir, G. Toderici, I. Laptev, M. Shah, and R. Sukthankar. THUMOS challenge: Action recognition with a large number of classes. http: //crcv.ucf.edu/THUMOS14/, 2014.
-
(2014)
THUMOS Challenge: Action Recognition with A Large Number of Classes
-
-
Jiang, Y.-G.1
Liu, J.2
Roshan Zamir, A.3
Toderici, G.4
Laptev, I.5
Shah, M.6
Sukthankar, R.7
-
12
-
-
84911441074
-
Efficient feature extraction, encoding, and classification for action recognition
-
IEEE
-
V. Kantorov and I. Laptev. Efficient feature extraction, encoding, and classification for action recognition. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 2593-2600. IEEE, 2014.
-
(2014)
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
, pp. 2593-2600
-
-
Kantorov, V.1
Laptev, I.2
-
15
-
-
84887386994
-
Multi-agent event detection: Localization and role assignment
-
S. Kwak, B. Han, and J. H. Han. Multi-agent event detection: Localization and role assignment. In CVPR, 2013.
-
(2013)
CVPR
-
-
Kwak, S.1
Han, B.2
Han, J.H.3
-
16
-
-
84863083227
-
Discriminative figure-centric models for joint action localization and recognition
-
IEEE
-
T. Lan, Y. Wang, and G. Mori. Discriminative figure-centric models for joint action localization and recognition. In Computer Vision (ICCV), 2011 IEEE International Conference on, pages 2003-2010. IEEE, 2011.
-
(2011)
Computer Vision (ICCV), 2011 IEEE International Conference on
, pp. 2003-2010
-
-
Lan, T.1
Wang, Y.2
Mori, G.3
-
17
-
-
51949083365
-
Learning realistic human actions from movies
-
IEEE
-
I. Laptev, M. Marsza?ek, C. Schmid, and B. Rozenfeld. Learning realistic human actions from movies. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1-8. IEEE, 2008.
-
(2008)
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
, pp. 1-8
-
-
Laptev, I.1
Marszaek, M.2
Schmid, C.3
Rozenfeld, B.4
-
18
-
-
84865583235
-
Incremental activity modeling in multiple disjoint cameras
-
C. C. Loy, T. Xiang, and S. Gong. Incremental activity modeling in multiple disjoint cameras. TPAMI, 34(9):1799-1813, 2012.
-
(2012)
TPAMI
, vol.34
, Issue.9
, pp. 1799-1813
-
-
Loy, C.C.1
Xiang, T.2
Gong, S.3
-
21
-
-
84911397627
-
Multiple granularity analysis for fine-grained action detection
-
IEEE
-
B. Ni, V. R. Paramathayalan, and P. Moulin. Multiple granularity analysis for fine-grained action detection. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 756-763. IEEE, 2014.
-
(2014)
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
, pp. 756-763
-
-
Ni, B.1
Paramathayalan, V.R.2
Moulin, P.3
-
24
-
-
77949275097
-
A survey on vision-based human action recognition
-
R. Poppe. A survey on vision-based human action recognition. IVC, 28:976-990, 2010.
-
(2010)
IVC
, vol.28
, pp. 976-990
-
-
Poppe, R.1
-
25
-
-
84961917629
-
-
arXiv preprint arXiv:1506.02640
-
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi. You only look once: Unified, real-time object detection. arXiv preprint arXiv:1506.02640, 2015.
-
(2015)
You only Look Once: Unified, Real-time Object Detection
-
-
Redmon, J.1
Divvala, S.2
Girshick, R.3
Farhadi, A.4
-
27
-
-
84866710901
-
A database for fine grained activity detection of cooking activities
-
IEEE
-
M. Rohrbach, S. Amin, M. Andriluka, and B. Schiele. A database for fine grained activity detection of cooking activities. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 1194-1201. IEEE, 2012.
-
(2012)
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
, pp. 1194-1201
-
-
Rohrbach, M.1
Amin, S.2
Andriluka, M.3
Schiele, B.4
-
28
-
-
84945944033
-
Imagenet large scale visual recognition challenge
-
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, et al. Imagenet large scale visual recognition challenge. International Journal of Computer Vision, pages 1-42, 2014.
-
(2014)
International Journal of Computer Vision
, pp. 1-42
-
-
Russakovsky, O.1
Deng, J.2
Su, H.3
Krause, J.4
Satheesh, S.5
Ma, S.6
Huang, Z.7
Karpathy, A.8
Khosla, A.9
Bernstein, M.10
-
29
-
-
84906347546
-
-
arXiv preprint arXiv:1312.6229
-
P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229, 2013.
-
(2013)
Overfeat: Integrated Recognition, Localization and Detection Using Convolutional Networks
-
-
Sermanet, P.1
Eigen, D.2
Zhang, X.3
Mathieu, M.4
Fergus, R.5
LeCun, Y.6
-
31
-
-
33845574026
-
Learning temporal sequence model from partially labeled data
-
IEEE
-
Y. Shi, A. Bobick, and I. Essa. Learning temporal sequence model from partially labeled data. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, volume 2, pages 1631-1638. IEEE, 2006.
-
(2006)
Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on
, vol.2
, pp. 1631-1638
-
-
Shi, Y.1
Bobick, A.2
Essa, I.3
-
32
-
-
84959200790
-
Joint inference of groups, events and human roles in aerial videos
-
T. Shu, D. Xie, B. Rothrock, S. Todorovic, and S.-C. Zhu. Joint inference of groups, events and human roles in aerial videos. In CVPR, 2015.
-
(2015)
CVPR
-
-
Shu, T.1
Xie, D.2
Rothrock, B.3
Todorovic, S.4
Zhu, S.-C.5
-
35
-
-
84962336509
-
-
arXiv preprint arXiv:1412.1441
-
C. Szegedy, S. Reed, D. Erhan, and D. Anguelov. Scalable, high-quality object detection. arXiv preprint arXiv:1412.1441, 2014.
-
(2014)
Scalable, High-quality Object Detection
-
-
Szegedy, C.1
Reed, S.2
Erhan, D.3
Anguelov, D.4
-
36
-
-
84887356306
-
Spatiotemporal deformable part models for action detection
-
IEEE
-
Y. Tian, R. Sukthankar, and M. Shah. Spatiotemporal deformable part models for action detection. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 2642-2649. IEEE, 2013.
-
(2013)
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
, pp. 2642-2649
-
-
Tian, Y.1
Sukthankar, R.2
Shah, M.3
-
40
-
-
78751648503
-
A survey of visionbased methods for action representation, segmentation and recognition
-
D. Weinland, R. Ronfard, and E. Boyer. A survey of visionbased methods for action representation, segmentation and recognition. In Computer Vision and Image Understanding, Vol. 115, Issues 2, pp. 224,241, 2010.
-
(2010)
Computer Vision and Image Understanding
, vol.115
, Issue.2
, pp. 224-241
-
-
Weinland, D.1
Ronfard, R.2
Boyer, E.3
-
42
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
R. J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4):229-256, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 229-256
-
-
Williams, R.J.1
-
43
-
-
85009857480
-
-
arXiv preprint arXiv:1502.03044
-
K. Xu, J. Ba, R. Kiros, A. Courville, R. Salakhutdinov, R. Zemel, and Y. Bengio. Show, attend and tell: Neural image caption generation with visual attention. arXiv preprint arXiv:1502.03044, 2015.
-
(2015)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Courville, A.4
Salakhutdinov, R.5
Zemel, R.6
Bengio, Y.7
-
44
-
-
77955995201
-
A hough transform-based voting framework for action recognition
-
IEEE
-
A. Yao, J. Gall, and L. Van Gool. A hough transform-based voting framework for action recognition. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 2061-2068. IEEE, 2010.
-
(2010)
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
, pp. 2061-2068
-
-
Yao, A.1
Gall, J.2
Van Gool, L.3
-
48
-
-
84973898486
-
-
arXiv preprint arXiv:1503.04144
-
S. Zha, F. Luisier, W. Andrews, N. Srivastava, and R. Salakhutdinov. Exploiting image-trained cnn architectures for unconstrained video classification. arXiv preprint arXiv:1503.04144, 2015.
-
(2015)
Exploiting Image-trained Cnn Architectures for Unconstrained Video Classification
-
-
Zha, S.1
Luisier, F.2
Andrews, W.3
Srivastava, N.4
Salakhutdinov, R.5
-
49
-
-
5044228350
-
Detecting unusual activity in video
-
IEEE
-
H. Zhong, J. Shi, and M. Visontai. Detecting unusual activity in video. In Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on, volume 2, pages II-819. IEEE, 2004.
-
(2004)
Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on
, vol.2
, pp. 11-819
-
-
Zhong, H.1
Shi, J.2
Visontai, M.3
-
50
-
-
84959314189
-
-
arXiv preprint arXiv:1506.06724
-
Y. Zhu, R. Kiros, R. Zemel, R. Salakhutdinov, R. Urtasun, A. Torralba, and S. Fidler. Aligning books and movies: Towards story-like visual explanations by watching movies and reading books. arXiv preprint arXiv:1506.06724, 2015.
-
(2015)
Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books
-
-
Zhu, Y.1
Kiros, R.2
Zemel, R.3
Salakhutdinov, R.4
Urtasun, R.5
Torralba, A.6
Fidler, S.7
|